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Preface 


This book is an extended and revised version of my German book Grammatiktheorie 
(Müller 2013a). It introduces various grammatical theories that play a role in current 
theorizing or have made contributions in the past which are still relevant today. I explain 
some foundational assumptions and then apply the respective theories to what can be 
called the “core grammar” of German. I have decided to stick to the object language that 
I used in the German version of this book since many of the phenomena that will be 
dealt with cannot be explained with English as the object language. Furthermore, many 
theories have been developed by researchers with English as their native language and it 
is illuminative to see these theories applied to another language. I show how the theories 
under consideration deal with arguments and adjuncts, active/passive alternations, local 
reorderings (so-called scrambling), verb position, and fronting of phrases over larger 
distances (the verb second property of the Germanic languages without English). 

The second part deals with foundational questions that are important for developing 
theories. This includes a discussion of the question of whether we have innate domain 
specific knowledge of language (UG), the discussion of psycholinguistic evidence con- 
cerning the processing of language by humans, a discussion of the status of empty ele- 
ments and of the question whether we construct and perceive utterances holistically or 
rather compositionally, that is, whether we use phrasal or lexical constructions. The sec- 
ond part is not intended as a standalone book although the printed version of the book 
is distributed this way for technical reasons (see below). Rather it contains topics that 
are discussed again and again when frameworks are compared. So instead of attaching 
these discussions to the individual chapters they are organized in a separate part of the 
book. 

Unfortunately, linguistics is a scientific field with a considerable amount of termino- 
logical chaos. I therefore wrote an introductory chapter that introduces terminology in 
the way it is used later on in the book. The second chapter introduces phrase structure 
grammars, which plays a role for many of the theories that are covered in this book. I 
use these two chapters (excluding the Section 2.3 on interleaving phrase structure gram- 
mars and semantics) in introductory courses of our BA curriculum for German studies. 
Advanced readers may skip these introductory chapters. The following chapters are 
structured in a way that should make it possible to understand the introduction of the 
theories without any prior knowledge. The sections regarding new developments and 
classification are more ambitious: they refer to chapters still to come and also point to 
other publications that are relevant in the current theoretical discussion but cannot be 
repeated or summarized in this book. These parts of the book address advanced stu- 
dents and researchers. I use this book for teaching the syntactic aspects of the theories 


Preface 


in a seminar for advanced students in our BA. The slides are available on my web page. 
The second part of the book, the general discussion, is more ambitious and contains the 
discussion of advanced topics and current research literature. 

This book only deals with relatively recent developments. For a historical overview, 
see for instance Robins (1997); Jungen & Lohnstein (2006). I am aware of the fact that 
chapters on Integrational Linguistics (Lieb 1983; Eisenberg 2004; Nolda 2007), Optimality 
Theory (Prince & Smolensky 1993; Grimshaw 1997; G. Miiller 2000), Role and Reference 
Grammar (Van Valin 1993) and Relational Grammar (Perlmutter 1983; 1984) are missing. 
I will leave these theories for later editions. 

The original German book was planned to have 400 pages, but it finally was much 
bigger: the first German edition has 525 pages and the second German edition has 564 
pages. I added a chapter on Dependency Grammar and one on Minimalism to the English 
version and now the book has 849 pages. I tried to represent the chosen theories appro- 
priately and to cite all important work. Although the list of references is over 85 pages 
long, I was probably not successful. I apologize for this and any other shortcomings. 


Available versions of this book 


The canonical version of this book is the PDF document available from the Language 
Science Press webpage of this book’. This page also links to a Print on Demand version. 
Since the book is very long, we decided to split the book into two volumes. The first vol- 
ume contains the description of all theories and the second volume contains the general 
discussion. Both volumes contain the complete list of references and the indices. The 
second volume starts with page 443. The printed volumes are therefore identical to the 
parts of the PDF document. 
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Andreea Calude, Rong Chen, Matthew Czuba, Leonel de Alencar, Christian Dohler, 
Joseph T. Farquharson, Andreas Hölzl, Gianina Iordächioaia, Paul Kay, Anne Kilgus, 
Sandra Kübler, Timm Lichte, Antonio Machicao y Priemer, Michelle Natolo, Stephanie 
Natolo, Sebastian Nordhoff, Elizabeth Pankratz, Parviz Parsafar, Conor Pyle, Daniela 
Schröder, Eva Schultze-Berndt, Alec Shaw, Benedikt Singpiel, Anelia Stefanova, Neal 
Whitman, Viola Wiegand) that each worked on one or more chapters and really im- 
proved this book. I got more comments from every one of them than I ever got for a 
book done with a commercial publisher. Some comments were on content rather than 
on typos and layout issues. No proofreader employed by a commercial publisher would 
have spotted these mistakes and inconsistencies since commercial publishers do not have 
staff that knows all the grammatical theories that are covered in this book. 

During the past years, anumber of workshops on theory comparison have taken place. 
I was invited to three of them. I thank Helge Dyvik and Torbjorn Nordgard for inviting 
me to the fall school for Norwegian PhD students Languages and Theories in Contrast, 
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which took place 2005 in Bergen. Guido Mensching and Elisabeth Stark invited me to 
the workshop Comparing Languages and Comparing Theories: Generative Grammar and 
Construction Grammar, which took place in 2007 at the Freie Universitat Berlin and An- 
dreas Pankau invited me to the workshop Comparing Frameworks in 2009 in Utrecht. I 
really enjoyed the discussion with all participants of these events and this book benefited 
enormously from the interchange. 

I thank Peter Gallmann for the discussion of his lecture notes on GB during my time 
in Jena. The Sections 3.1.3-3.4 have a structure that is similar to the one of his script 
and take over a lot. Thanks to David Reitter for the KIEX macros for Combinatorial Cat- 
egorial Grammar, to Mary Dalrymple and Jonas Kuhn for the LFG macros and example 
structures, and to Laura Kallmeyer for the BIEN sources of most of the TAG analyses. 
Most of the trees have been adapted to the forest package because of compatibility is- 
sues with X4EIEX, but the original trees and texts were a great source of inspiration and 
without them the figures in the respective chapters would not be half as pretty as they 
are now. 

I thank Sašo Živanović for implementing the KIEX package forest. It really simpli- 
fies typesetting of trees, dependency graphs, and type hierarchies. I also thank him 
for individual help via email and on stackexchange. In general, those active on stack- 
exchange could not be thanked enough: most of my questions regarding specific de- 
tails of the typesetting of this book or the implementation of the KIEX classes that 
are used by Language Science Press now have been answered within several minutes. 
Thank you! Since this book is a true open access book under the CC-BY license, it can 
also be an open source book. The interested reader finds a copy of the source code at 
https://github.com/langsci/25. By making the book open source I pass on the knowledge 
provided by the DIEN gurus and hope that others benefit from this and learn to typeset 
their linguistics papers in nicer and/or more efficient ways. 

Viola Auermann and Antje Bahlke, Sarah Dietzfelbinger, Lea Helmers, and Chiara 
Jancke cannot be thanked enough for their work at the copy machines. Viola also helped 
a lot with proof reading prefinal stages of the translation. I also want to thank my (for- 
mer) lab members Felix Bildhauer, Philippa Cook, Janna Lipenkova, Jakob Maché, Bjarne 
Orsnes and Roland Schäfer, which were mentioned above already for other reasons, for 
their help with teaching. During the years from 2007 until the publication of the first 
German edition of this book two of the three tenured positions in German Linguistics 
were unfilled and I would have not been able to maintain the teaching requirements 
without their help and would have never finished the Grammatiktheorie book. 

I thank Tibor Kiss for advice in questions of style. His diplomatic way always was a 
shining example for me and I hope that this is also reflected in this book. 


On the way this book is published 


I started to work on my dissertation in 1994 and defended it in 1997. During the whole 
time the manuscript was available on my web page. After the defense, I had to look 
for a publisher. I was quite happy to be accepted to the series Linguistische Arbeiten 
by Niemeyer, but at the same time I was shocked about the price, which was 186.00 
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DM for a paperback book that was written and typeset by me without any help by the 
publisher (twenty times the price of a paperback novel). This basically meant that my 
book was depublished: until 1998 it was available from my web page and after this it was 
available in libraries only. My Habilitationsschrift was published by CSLI Publications 
for a much more reasonable price. When I started writing textbooks, I was looking for 
alternative distribution channels and started to negotiate with no-name print on demand 
publishers. Brigitte Narr, who runs the Stauffenburg publishing house, convinced me to 
publish my HPSG textbook with her. The copyrights for the German version of the book 
remained with me so that I could publish it on my web page. The collaboration was 
successful so that I also published my second textbook about grammatical theory with 
Stauffenburg. I think that this book has a broader relevance and should be accessible 
for non-German-speaking readers as well. I therefore decided to have it translated into 
English. Since Stauffenburg is focused on books in German, I had to look for another 
publisher. Fortunately the situation in the publishing sector changed quite dramatically 
in comparison to 1997: we now have high profile publishers with strict peer review that 
are entirely open access. I am very glad about the fact that Brigitte Narr sold the rights 
of my book back to me and that I can now publish the English version with Language 
Science Press under a CC-BY license. 


Language Science Press: scholar-owned high quality 
linguistic books 


In 2012 a group of people found the situation in the publishing business so unbearable 
that they agreed that it would be worthwhile to start a bigger initiative for publishing 
linguistics books in platinum open access, that is, free for both readers and authors. 
I set up a web page and collected supporters, very prominent linguists from all over 
the world and all subdisciplines and Martin Haspelmath and I then founded Language 
Science Press. At about the same time the DFG had announced a program for open access 
monographs and we applied (Miiller & Haspelmath 2013) and got funded (two out of 18 
applications got funding). The money is used for a coordinator (Dr. Sebastian Nordhoff) 
and an economist (Debora Siller), two programmers (Carola Fanselow and Dr. Mathias 
Schenner), who work on the publishing plattform Open Monograph Press (OMP) and on 
conversion software that produces various formats (ePub, XML, HTML) from our ÉTEX 
code. Svantje Lilienthal works on the documentation of OMP, produces screencasts and 
does user support for authors, readers and series editors. 

OMP is extended by open review facilities and community-building gamification tools 
(Müller 2012a; Müller & Haspelmath 2013). All Language Science Press books are re- 
viewed by at least two external reviewers. Reviewers and authors may agree to publish 
these reviews and thereby make the whole process more transparent (see also Pullum 
(1984) for the suggestion of open reviewing of journal articles). In addition there is an 
optional second review phase: the open review. This review is completely open to ev- 


? As a side remark: in the meantime Niemeyer was bought by de Gruyter and closed down. The price of the 
book is now 139.95 €/ $ 196.00. The price in Euro corresponds to 273.72 DM. 
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erybody. The whole community may comment on the document that is published by 
Language Science Press. After this second review phase, which usually lasts for two 
months, authors may revise their publication and an improved version will be published. 
This book was the first book to go through this open review phase. The annotated open 
review version of this book is still available via the web page of this book. 

Currently, Language Science Press has 17 series on various subfields of linguistics with 
high profile series editors from all continents. We have 18 published and 17 forthcoming 
books and 146 expressions of interest. Series editors and authors are responsible for 
delivering manuscripts that are typeset in BIFX, but they are supported by a web-based 
typesetting infrastructure that was set up by Language Science Press and by volunteer 
typesetters from the community. Proofreading is also community-based. Until now 53 
people helped improving our books. Their work is documented in the Hall of Fame: 
http://langsci-press.org/hallOfFame. 

If you think that textbooks like this one should be freely available to whoever wants 
to read them and that publishing scientific results should not be left to profit-oriented 
publishers, then you can join the Language Science Press community and support us 
in various ways: you can register with Language Science Press and have your name 
listed on our supporter page with almost 600 other enthusiasts, you may devote your 
time and help with proofreading and/or typesetting, or you may donate money for 
specific books or for Language Science Press in general. We are also looking for in- 
stitutional supporters like foundations, societies, linguistics departments or university 
libraries. Detailed information on how to support us is provided at the following web- 
page: http://langsci-press.org/supportUs. In case of questions, please contact me or the 
Language Science Press coordinator at contact@langsci-press.org. 


Berlin, March 11, 2016 Stefan Müller 


Foreword of the second edition 


The first edition of this book was published almost exactly two years ago. The book 
has app. 15,000 downloads and is used for teaching and in research all over the world. 
This is what every author and every teacher dreams of: distribution of knowledge and 
accessibility for everybody. The foreword of the first edition ends with a description of 
Language Science Press in 2016. This is the situation now: We have 324 expressions 
of interest and 58 published books. Books are published in 20 book series with 263 
members of editorial boards from 44 different countries from six continents. We have 
a total of 175,000 downloads. 138 linguists from all over the world have participated in 
proofreading. There are currently 296 proofreaders registered with Language Science 
Press. Language Science Press is a community-based publisher, but there is one person 
who manages everything: Sebastian Nordhoff. His position has to be paid. We were 
successful in acquiring financial support by almost 100 academic institutions including 
Harvard, the MIT, and Berkeley. If you want to support us by just signing the list 


3See http://userblogs.fu-berlin.de/langsci-press/2018/01/18/achievements-2017/ for the details and graphics. 
^A full list of supporting institutions is available here: http://langsci-press.org/knowledgeunlatched. 
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of supporters, by publishing with us, by helping as proofreader or by convincing your 
librarian/institution to support Language Science Press financially, please refer to http: 
//langsci-press.org/supportUs. 

After these more general remarks concerning Language Science Press I describe the 
changes I made for the second edition and I thank those who pointed out mistakes and 
provided feedback. 

I want to thank Wang Lulu for pointing out some typos that she found while translat- 
ing the book to Chinese. Thanks for both the typos and the translation. 

Fritz Hamm noticed that the definition of Intervention (see p. 138) was incomplete and 
pointed out some inconsistencies in translations of predicates in Section 2.3. I turned 
some straight lines in Chapter 3 into triangles and added a discussion of different ways 
to represent movement (see Figure 3.8 on p. 99). I now explain what SpecIP stands for 
and I added footnote 8 on SpecIP as label in trees. I extended the discussion of Pirahä in 
Section 13.1.8.2 and added lexical items that show that Pirahä-like modification without 
recursion can be captured in a straightforward way in Categorial Grammar. 

I reorganized the HPSG chapter to be in line with more recent approaches assuming 
the valence features spr and comps (Sag 1997; Müller 2019b) rather than a single valence 
feature. I removed the section on the LOCAL feature in Sign-based Construction Grammar 
(Section 10.6.2.2 in the first edition) since it was build on the wrong assumption that the 
filler would be identical to the representation in the valence specification. In Sag (2012: 
536) only the information in syn and sEM is shared. 

I added the example (60) on page 628 that shows a difference in choice of preposition 
in a prepositional object in Dutch vs. German. Since the publication of the first En- 
glish edition of the Grammatical Theory textbook I worked extensively on the phrasal 
approach to benefactive constructions in LFG (Asudeh, Giorgolo & Toivonen 2014). Sec- 
tion 21.2.2 was revised and adapted to what will be published as Müller (2018a). There 
is now a brief chapter on complex predicates in TAG and Categorial Grammar/HPSG 
(Chapter 22), that shows that valence-based approaches allow for an underspecification 
of structure. Valence is potential structure, while theories like TAG operate with actual 
structure. 

Apart from this I fixed several minor typos, added and updated some references and 
URLs. Thanks to Philippa Cook, Timm Lichte, and Antonio Machicao y Priemer for 
pointing out typos. Thanks to Leonel Figueiredo de Alencar, Francis Bond, John Carroll, 
Alexander Koller, Emily M. Bender, and Glenn C. Slayden for pointers to literature. Sašo 
Zivanovié helped adapting version 2.0 of the forest package so that it could be used 
with this large book. Iam very graceful for this nice tree typesetting package and all the 
work that went into it. 

The source code of the book and the version history is available on GitHub. Issues 
can be reported there: https://github.com/langsci/25. The book is also available on paper- 
hive, a platform for collective reading and annotation: https://paperhive.org/documents/ 
remote?type-langsci&id-25. It would be great if you would leave comments there. 


Berlin, 21st March 2018 Stefan Müller 
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Foreword of the third edition 


Since more and more researchers and students are using the book now, I get feedback 
that helps improve it. For the third edition I added references, expanded the discussion 
of the passive in GB (Section 3.4) a bit and fixed typos.” 

Chapter 4 contained figures from different chapters of Adger (2003). Adger introduces 
the DP rather late in the book and I had a mix of NPs and DPs in figures. I fixed this 
in the new edition. I am so used to talking about NPs that there were references to NP 
in the general discussion that should have been references to DP. I fixed this as well. I 
added a figure explaining the architecture in the Phase model of Minimalism and since 
the figures mention the concept of numeration, I added a footnote on numerations. I also 
added a figure depicting the architecture assumed in Minimalist theories with Phases 
(right figure in Figure 4.1). 

I thank Frank Van Eynde for pointing out eight typos in his review of the first edition. 
They have been fixed. He also pointed out that the placement of ARG-ST in the feature 
geometry of signs in HPSG did not correspond to Ginzburg & Sag (2000), where ARG-ST 
is on the top level rather than under car. Note that earlier versions of this book had arc- 
ST under cat and there had never been proper arguments for why it should not be there, 
which is why many practitioners of HPSG have kept it in that position (Müller 2018a). 
One reason to keep ARG-ST on the top level is that ARG-ST is appropriate for lexemes only. 
If ARG-ST is on the sign level, this can be represented in the type hierarchy: lexemes and 
word have an ARG-ST feature, phrases do not. If ARG-ST is on the car level, one would 
have to distinguish between cAT values that belong to lexemes and words on the one 
hand and phrasal caT values on the other hand, which would require two additional 
subtypes of the type cat. The most recent version of the computer implementation done 
in Stanford by Dan Flickinger has ARG-sT under LOCAL (2019-01-24). So, I was tempted 
to leave everything as it was in the second edition of the book. However, there is a real 
argument for not having ARG-ST under CAT. CAT is assumed to be shared in coordinations 
and cAT contains valence features for subjects and complements. The values of these 
valence features are determined by a mapping from ARG-ST. In some analyses, extracted 
elements are not mapped to the valence features and the same is sometimes assumed for 
omitted elements. To take an example consider (1): 


(1) He saw and helped the hikers. 


saw and helped are coordinated and the members in the valence lists have to be compati- 
ble. Now if one coordinates a ditransitive verb with one omitted argument with a strictly 
transitive verb, this would work under the assumption that the omitted argument is not 
part of the valence representation. But if ARG-ST is part of CAT, coordination would be 
made impossible since a three-place argument structure list would be incompatible with 
a two-place list. Hence I decided to change this in the third edition and represent ARG-ST 
outside of CAT from now on. 


5 A detailed list of issues and fixes can be found in the GitHub repository of this book at https://github.com/ 
langsci/25/. 
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I changed the section about Sign-Based Construction Grammar (SBCG) again. An 
argument about nonlocal dependencies and locality was not correct, since Sag (2012: 
166) does not share all information between filler and extraction side. The argument is 
now revised and presented as Section 10.6.2.3. Reviewing Miiller (2019c), Bob Borsley 
pointed out to me that the xAr feature is a way to circumvent locality restrictions that 
is actually used in SBCG. I added a footnote to the section on locality in SBCG. 

A brief discussion of Welke’s (2019) analysis of the German clause structure was added 
to the chapter about Construction Grammar (see Section 10.3). 

The analysis of a verb-second sentence in LFG is now part of the LFG chapter (Fig- 
ure 7.5 on page 242) and not just an exercise in the appendix. A new exercise was de- 
signed instead of the old one and the old one was integrated into the main text. 

I added a brief discussion of Osborne’s (2019) claim that Dependency Grammars are 
simpler than phrase structure grammars (p. 409). 

Geoffrey Pullum pointed out at the HPSG conference in 2019 that the label constraint- 
based may not be the best for the theories that are usually referred to with it. Changing 
the term in this work would require to change the title of the book. The label model 
theoretic may be more appropriate but some implementational work in HPSG and LFG 
not considering models may find the term inappropriate. I hence decided to stick to the 
established term. 

I followed the advice by Lisbeth Augustinus and added a preface to Part II of the book 
that gives the reader some orientation as to what to expect. 

I thank Mikhail Knyazev for pointing out to me that the treatment of V to I to C 
movement in the German literature differs from the lowering that is assumed for English 
and that some further references are needed in the chapter on Government & Binding. 

Working on the Chinese translation of this book, Wang Lulu pointed out some typos 
and a wrong example sentence in Chinese. Thanks for these comments! 

I thank Bob Borsley, Gisbert Fanselow, Hubert Haider and Pavel Logacev for discus- 
sion and Ina Baier for a mistake in a CG proof and Jonas Benn for pointing out some 
typos to me. Thanks to Tabea Reiner for a comment on gradedness. Thanks also to An- 
tonio Machicao y Priemer for yet another set of comments on the second edition and to 
Elizabeth Pankratz for proofreading parts of what I changed. 


Berlin, 15th August 2019 Stefan Miller 
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Part I 


Background and specific theories 


1 Introduction and basic terms 


The aim of this chapter is to explain why we actually study syntax (Section 1.1) and why 
it is important to formalize our findings (Section 1.2). Some basic terminology will be 
introduced in Sections 1.3-1.8: Section 1.3 deals with criteria for dividing up utterances 
into smaller units. Section 1.4 shows how words can be grouped into classes; that is I will 
introduce criteria for assigning words to categories such as verb or adjective. Section 1.5 
introduces the notion of heads, in Section 1.6 the distinction between arguments and ad- 
juncts is explained, Section 1.7 defines grammatical functions and Section 1.8 introduces 
the notion of topological fields, which can be used to characterize certain areas of the 
clause in languages such as German. 

Unfortunately, linguistics is a scientific field with a considerable amount of termino- 
logical chaos. This is partly due to the fact that terminology originally defined for certain 
languages (e.g., Latin, English) was later simply adopted for the description of other lan- 
guages as well. However, this is not always appropriate since languages differ from one 
another considerably and are constantly changing. Due to the problems caused by this, 
the terminology started to be used differently or new terms were invented. When new 
terms are introduced in this book, I will always mention related terminology or differing 
uses of each term so that readers can relate this to other literature. 


1.1 Why do syntax? 


Every linguistic expression we utter has a meaning. We are therefore dealing with what 
has been referred to as form-meaning pairs (de Saussure 1916b). A word such as tree in 
its specific orthographical form or in its corresponding phonetic form is assigned the 
meaning tree’. Larger linguistic units can be built up out of smaller ones: words can be 
joined together to form phrases and these in turn can form sentences. 

The question which now arises is the following: do we need a formal system which can 
assign a structure to these sentences? Would it not be sufficient to formulate a pairing 
of form and meaning for complete sentences just as we did for the word tree above? 

That would, in principle, be possible if a language were just a finite list of word se- 
quences. If we were to assume that there is a maximum length for sentences and a max- 
imum length for words and thus that there can only be a finite number of words, then 
the number of possible sentences would indeed be finite. However, even if we were to 
restrict the possible length of a sentence, the number of possible sentences would still be 
enormous. The question we would then really need to answer is: what is the maximum 
length of a sentence? For instance, it is possible to extend all the sentences in (1): 
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(1) a. This sentence goes on and on and on and on... 
b. [A sentence is a sentence] is a sentence. 


c. that Max thinks that Julius knows that Otto claims that Karl suspects that 
Richard confirms that Friederike is laughing 


In (1b), something is being said about the group of words a sentence is a sentence, namely 
that itis a sentence. One can, of course, claim the same for the whole sentence in (1b) and 
extend the sentence once again with is a sentence. The sentence in (1c) has been formed 
by combining that Friederike is laughing with that, Richard and confirms. The result of 
this combination is a new sentence that Richard confirms that Friederike is laughing. In 
the same way, this has then been extended with that, Karl and suspects. Thus, one obtains 
a very complex sentence which embeds a less complex sentence. This partial sentence in 
turn contains a further partial sentence and so on. (1c) is similar to those sets of Russian 
nesting dolls, also called matryoshka: each doll contains a smaller doll which can be 
painted differently from the one that contains it. In just the same way, the sentence in 
(1c) contains parts which are similar to it but which are shorter and involve different 
nouns and verbs. This can be made clearer by using brackets in the following way: 


(2) that Max thinks [that Julius knows [that Otto claims [that Karl suspects [that Rich- 
ard confirms [that Friederike is laughing]]]]] 


We can build incredibly long and complex sentences in the ways that were demonstrated 
in (1). 

It would be arbitrary to establish some cut-off point up to which such combinations 
can be considered to belong to our language (Harris 1957: 208; Chomsky 1957: 23). It is 
also implausible to claim that such complex sentences are stored in our brains as a single 
complex unit. While evidence from psycholinguistic experiments shows that highly fre- 
quent or idiomatic combinations are stored as complex units, this could not be the case 
for sentences such as those in (1). Furthermore, we are capable of producing utterances 
that we have never heard before and which have also never been uttered or written down 
previously. Therefore, these utterances must have some kind of structure, there must be 
patterns which occur again and again. As humans, we are able to build such complex 
structures out of simpler ones and, vice-versa, to break down complex utterances into 
their component parts. Evidence for humans’ ability to make use of rules for combining 
words into larger units has now also been provided by research in neuroscience (Pulver- 
müller 2010: 170). 


TIt is sometimes claimed that we are capable of constructing infinitely long sentences (Nowak, Komarova 
& Niyogi 2001: 117; Kim & Sells 2008: 3; Dan Everett in O'Neill & Wood (2012) at 25:19; Chesi 2015: 67; 
Martorell 2018: 2; Wikipedia entry of Biolinguistics, 2018-09-27) or that Chomsky made such claims (Leiss 
2003: 341). This is, however, not correct since every sentence has to come to an end at some point. Even 
in the theory of formal languages developed in the Chomskyan tradition, there are no infinitely long 
sentences. Rather, certain formal grammars can describe a set containing infinitely many finite sentences 
(Chomsky 1957: 13). See also Pullum & Scholz (2010) and Section 13.1.8 on the issue of recursion in grammar 
and for claims about the infinite nature of language. 
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It becomes particularly evident that we combine linguistic material in a rule-governed 
way when these rules are violated. Children acquire linguistic rules by generalizing from 
the input available to them. In doing so, they produce some utterances which they could 
not have ever heard previously: 


(3) Ich festhalte die. (Friederike, 2;6) 
I part.hold them 


Intended: ‘I hold them tight. 


Friederike, who was learning German, was at the stage of acquiring the rule for the 
position of the finite verb (namely, second position). What she did here, however, was 
to place the whole verb, including a separable particle fest ‘tight’, in the second position 
although the particle should be realized at the end of the clause (Ich halte die fest.). 

If we do not wish to assume that language is merely a list of pairings of form and mean- 
ing, then there must be some process whereby the meaning of complex utterances can 
be obtained from the meanings of the smaller components of those utterances. Syntax 
reveals something about the way in which the words involved can be combined, some- 
thing about the structure of an utterance. For instance, knowledge about subject-verb 
agreement helps with the interpretation of the following sentences in German: 


(4) a. DieFrau schläft. 
the woman sleep.3sG 


"Ihe woman sleeps: 


b. Die Madchen schlafen. 
the girls sleep.3PL 


‘The girls sleep! 
c. DieFrau kennt die Mädchen. 
the woman know.3sc the girls 


‘The woman knows the girls. 


d. DieFrau kennen die Mädchen. 
the woman know.3?r the girls 


"Ihe girls know the woman: 


The sentences in (4a,b) show that a singular or a plural subject requires a verb with the 
corresponding inflection. In (4a,b), the verb only requires one argument so the function 
of die Frau ‘the woman’ and die Madchen ‘the girls’ is clear. In (4c,d) the verb requires 
two arguments and die Frau ‘the woman’ and die Mddchen ‘the girls’ could appear in 
either argument position in German. The sentences could mean that the woman knows 
somebody or that somebody knows the woman. However, due to the inflection on the 
verb and knowledge of the syntactic rules of German, the hearer knows that there is 
only one available reading for (4c) and (4d), respectively. 

It is the role of syntax to discover, describe and explain such rules, patterns and struc- 
tures. 


1 Introduction and basic terms 


1.2 Why do it formally? 


The two following quotations give a motivation for the necessity of describing language 
formally: 


Precisely constructed models for linguistic structure can play an important role, 
both negative and positive, in the process of discovery itself. By pushing a precise 
but inadequate formulation to an unacceptable conclusion, we can often expose the 
exact source of this inadequacy and, consequently, gain a deeper understanding of 
the linguistic data. More positively, a formalized theory may automatically provide 
solutions for many problems other than those for which it was explicitly designed. 
Obscure and intuition-bound notions can neither lead to absurd conclusions nor 
provide new and correct ones, and hence they fail to be useful in two important 
respects. I think that some of those linguists who have questioned the value of 
precise and technical development of linguistic theory have failed to recognize the 
productive potential in the method of rigorously stating a proposed theory and 
applying it strictly to linguistic material with no attempt to avoid unacceptable 
conclusions by ad hoc adjustments or loose formulation. (Chomsky 1957: 5) 


As is frequently pointed out but cannot be overemphasized, an important goal of 
formalization in linguistics is to enable subsequent researchers to see the defects 
of an analysis as clearly as its merits; only then can progress be made efficiently. 
(Dowty 1979: 322) 


If we formalize linguistic descriptions, it is easier to recognize what exactly a particular 
analysis means. We can establish what predictions it makes and we can rule out alter- 
native analyses. A further advantage of precisely formulated theories is that they can 
be written down in such a way that computer programs can process them. When a the- 
oretical analysis is implemented as a computationally processable grammar fragment, 
any inconsistency will become immediately evident. Such implemented grammars can 
then be used to process large collections of text, so-called corpora, and they can thus 
establish which sentences a particular grammar cannot yet analyze or which sentences 
are assigned the wrong structure. For more on using computer implementation in lin- 
guistics see Bierwisch (1963: 163), Müller (1999b: Chapter 22) and Bender (2008b) as well 
as Section 3.6.2. 


1.3 Constituents 


If we consider the sentence in (5), we have the intuition that certain words form a unit. 


(5) Alle Studenten lesen während dieser Zeit Bücher. 
all students read during this time books 


‘All the students are reading books at this time? 


For example, the words alle ‘all’ and Studenten ‘students’ form a unit which says some- 
thing about who is reading. während ‘during’, dieser ‘this’ and Zeit ‘time’ also form a 
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unit which refers to a period of time during which the reading takes place, and Biicher 
‘books’ says something about what is being read. The first unit is itself made up of two 
parts, namely alle ‘all’ and Studenten ‘students’. The unit während dieser Zeit ‘during 
this time’ can also be divided into two subcomponents: während ‘during’ and dieser Zeit 
‘this time’. dieser Zeit ‘this time’ is also composed of two parts, just like alle Studenten 
‘all students’ is. 

Recall that in connection with (1c) above we talked about the sets of Russian nesting 
dolls (matryoshkas). Here, too, when we break down (5) we have smaller units which are 
components of bigger units. However, in contrast to the Russian dolls, we do not just 
have one smaller unit contained in a bigger one but rather, we can have several units 
which are grouped together in a bigger one. The best way to envisage this is to imagine 
a system of boxes: one big box contains the whole sentence. Inside this box, there are 
four other boxes, which each contain alle Studenten ‘all students’, lesen ‘reads’, während 
dieser Zeit ‘during this time’ and Bücher ‘books’, respectively. Figure 1.1 illustrates this. 


Figure 1.1: Words and phrases in boxes 


In the following section, I will introduce various tests which can be used to show how 
certain words seem to “belong together” more than others. When I speak of a word se- 
quence, I generally mean an arbitrary linear sequence of words which do not necessarily 
need to have any syntactic or semantic relationship, e.g., Studenten lesen während ‘stu- 
dents read during’ in (5). A sequence of words which form a structural entity, on the 
other hand, is referred to as a phrase. Phrases can consist of words as in this time or of 
combinations of words with other phrases as in during this time. The parts of a phrase 
and the phrase itself are called constituents. So all elements that are in a box in Figure 1.1 
are constituents of the sentence. 

Following these preliminary remarks, I will now introduce some tests which will help 
us to identify whether a particular string of words is a constituent or not. 


1.3.1 Constituency tests 


There are a number of ways to test the constituent status of a sequence of words. In the 
following subsections, I will present some of these. In Section 1.3.2, we will see that there 
are cases when simply applying a test “blindly” leads to unwanted results. 


1.3.1.1 Substitution 


If it is possible to replace a sequence of words in a sentence with a different sequence 
of words and the acceptability of the sentence remains unaffected, then this constitutes 
evidence for the fact that each sequence of words forms a constituent. 
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In (6), den Mann ‘the man’ can be replaced by the string eine Frau ‘a woman’. This is 
an indication that both of these word sequences are constituents. 


(6) a. Erkennt [den Mann]. 
he knows the man 
‘He knows the man? 


b. Er kennt [eine Frau]. 
he knows a woman 


‘He knows a woman. 


Similary, in (7a), the string das Buch zu lesen ‘the book to read’ can be replaced by dem 
Kind das Buch zu geben ‘the woman the book to give’. 


(7) a. Er versucht, [das Buch zu lesen]. 
he tries the book to read 
‘He is trying to read the book’ 


b. Er versucht, [dem Kind das Buch zu geben]. 
he tries the child the book to give 


‘He is trying to give the child the book? 


This test is referred to as the substitution test. 


1.3.1.2 Pronominalization 


Everything that can be replaced by a pronoun forms a constituent. In (8), one can for 
example refer to der Mann ‘the man’ with the pronoun er ‘he’: 


(8 a. [Der Mann] schláft. 
the man sleeps 
"Ihe man is sleeping’ 
b. Er schláft. 

he sleeps 

‘He is sleeping: 
It is also possible to use a pronoun to refer to constituents such as das Buch zu lesen ‘the 
book to read’ in (7a), as is shown in (9): 


(9) a. Peter versucht, [das Buch zu lesen]. 
Peter tries the book to read 
‘Peter is trying to read the book? 


b. Klaus versucht das auch. 
Klaus tries that also 


‘Klaus is trying to do that as well! 


The pronominalization test is another form of the substitution test. 
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1.3.1.3 Question formation 


A sequence of words that can be elicited by a question forms a constituent: 


(10) a. [Der Mann] arbeitet. 
the man works 
‘The man is working. 
b. Wer arbeitet? 
who works 


“Who is working?’ 


Question formation is a specific case of pronominalization. One uses a particular type 
of pronoun (an interrogative pronoun) to refer to the word sequence. 

Constituents such as das Buch zu lesen in (7a) can also be elicited by questions, as (11) 
shows: 


(11) Was versucht er? 
what tries he 


“What does he try?’ 


1.3.1.4 Permutation test 


If a sequence of words can be moved without adversely affecting the acceptability ofthe 
sentence in which it occurs, then this is an indication that this word sequence forms a 
constituent. 

In (12), keiner ‘nobody’ and dieses Kind ‘this child’ exhibit different orderings, which 
suggests that dieses ‘this’ and Kind ‘child’ belong together. 


(12) a. dasskeiner [dieses Kind] kennt 
that nobody this child knows 


b. dass [dieses Kind] keiner kennt 
that this child nobody knows 


“that nobody knows this child’ 
On the other hand, it is not plausible to assume that keiner dieses 'nobody this’ forms a 


constituent in (12a). If we try to form other possible orderings by trying to move keiner 
dieses ‘nobody this’ as a whole, we see that this leads to unacceptable results:? 


(13) a. "dass Kind keiner dieses kennt 
that child nobody this knows 


* 


?] use the following notational conventions for all examples: ** indicates that a sentence is ungrammatical, 
"P denotes that the sentence has a reading which differs from the intended one and finally ‘§’ should be 
understood as a sentence which is deviant for semantic or information-structural reasons, for example, 
because the subject must be animate, but is in fact inanimate in the example in question, or because there 
is a conflict between constituent order and the marking of given information through the use of pronouns. 
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b. *dass Kind kennt keiner dieses 
that child knows nobody this 


Furthermore, constituents such as das Buch zu lesen ‘to read the book’ in (7a) can be 
moved: 


(14) a. Er hat noch nicht [das Buch zu lesen] versucht. 
he has PART not the book to read tried 


"He has not yet tried to read the book 


b. Er hat [das Buch zu lesen] noch nicht versucht. 
he has the book to read PART not tried 


c. Er hat noch nicht versucht, [das Buch zu lesen]. 
he has PART not tried the book to read 


1.3.1.5 Fronting 


Fronting is a further variant of the movement test. In German declarative sentences, 
only a single constituent may normally precede the finite verb: 


(15 a. [Alle Studenten] lesen während der vorlesungsfreien Zeit Bücher. 
all students read.zpı during the lecture.free time books 


“All students read books during the semester break. 


b. [Bücher] lesen alle Studenten während der vorlesungsfreien Zeit. 
books read all students during the lecture.free time 


c. * [Alle Studenten] [Bücher] lesen während der vorlesungsfreien Zeit. 
all students books read during the lecture.free time 


d. * [Bücher] [alle Studenten] lesen während der vorlesungsfreien Zeit. 
books all students read during the lecture.free time 


The possibility for a sequence of words to be fronted (that is to occur in front of the finite 
verb) is a strong indicator of constituent status. 


1.3.1.6 Coordination 


If two sequences of words can be conjoined then this suggests that each sequence forms 
a constituent. 

In (16), der Mann ‘the man’ and die Frau ‘the woman’ are conjoined and the entire 
coordination is the subject of the verb arbeiten ‘to work’. This is a good indication of the 
fact that der Mann and die Frau each form a constituent. 


(16) [Der Mann] und [die Frau] arbeiten. 
the man and the woman work.3PL 


“Ihe man and the woman work. 
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The example in (17) shows that phrases with to-infinitives can be conjoined: 


(17) Er hat versucht, [das Buch zu lesen] und [es dann unauffällig verschwinden zu 
he had tried the book to read and it then secretly disappear to 
lassen]. 
let 


‘He tried to read the book and then make it quietly disappear?’ 


1.3.2 Some comments on the status of constituent tests 


It would be ideal if the tests presented here delivered clear-cut results in every case, as 
the empirical basis on which syntactic theories are built would thereby become much 
clearer. Unfortunately, this is not the case. There are in fact a number of problems with 
constituent tests, which I will discuss in what follows. 


1.3.2.1 Expletives 


There is a particular class of pronouns - so-called expletives - which do not denote peo- 
ple, things, or events and are therefore non-referential. An example of this is es ‘it’ in 
(18). 
(18) a. Es regnet. 
it rains 
‘It is raining? 
b. Regnet es? 
rains it 
‘Is it raining?’ 
C. dass es jetzt regnet 
that it now rains 
‘that it is raining now’ 
As the examples in (18) show, es can either precede the verb, or follow it. It can also be 
separated from the verb by an adverb, which suggests that es should be viewed as an 
independent unit. 


Nevertheless, we observe certain problems with the aforementioned tests. Firstly, es 
‘it’ is restricted with regard to its movement possibilities, as (19a) and (20b) show. 


(19) a. * dass jetzt es regnet 
that now it rains 
Intended: 'that it is raining now' 


b. dass jetzt keiner klatscht 
that now nobody claps 


‘that nobody is clapping now’ 
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(20) a. Er sah es regnen. 
he saw it.Acc rain 


‘He saw that it was raining’ 


b. *Es sah er regnen. 
it.Acc saw he rain 


Intended: ‘he saw that it was raining’ 


c. Er sah einen Mann klatschen. 
he saw a.acc man clap 


‘He saw a man clapping’ 


d. Einen Mann sah er klatschen. 
a.Acc man saw he clap 


‘A man, he saw clapping?’ 


Unlike the accusative object einen Mann ‘a man’ in (20c,d), the expletive in (20b) cannot 
be fronted. 
Secondly, substitution and question tests also fail: 


(21 a. "Der Mann / er regnet. 
the man he rains 


b. "Wer / was regnet? 
who what rains 


Similarly, the coordination test cannot be applied either: 


(22) “Es und der Mann regnet / regnen. 
it and the man rains rain 


The failure of these tests can be easily explained: weakly stressed pronouns such as es are 
preferably placed before other arguments, directly after the conjunction (dass in (18c)) 
and directly after the finite verb in (20a) (see Abraham 1995: 570). If an element is placed 
in front of the expletive, as in (19a), then the sentence is rendered ungrammatical. The 
reason for the ungrammaticality of (20b) is the general ban on accusative es appearing 
in clause-initial position. Although such cases exist, they are only possible if es ‘it’ is 
referential (Lenerz 1994: 162; Gartner & Steinbach 1997: 4). 

The fact that we could not apply the substitution and question tests is also no longer 
mysterious as es is not referential in these cases. We can only replace es 'it' with another 
expletive such as das ‘that’. If we replace the expletive with a referential expression, we 
derive a different semantic interpretation. It does not make sense to ask about something 
semantically empty or to refer to it with a pronoun. 

It follows from this that not all of the tests must deliver a positive result for a se- 
quence of words to count as a constituent. That is, the tests are therefore not a necessary 
requirement for constituent status. 
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1.3.2.2 Movement 


The movement test is problematic for languages with relatively free constituent order, 
since it is not always possible to tell what exactly has been moved. For example, the 
string gestern dem Mann ‘yesterday the man’ occupies different positions in the following 
examples: 


(23) a. weil keiner gestern dem Mann geholfen hat 
because nobody yesterday the man helped has 


‘because nobody helped the man yesterday’ 


b. weil gestern dem Mann keiner geholfen hat 
because yesterday the man nobody helped has 


‘because nobody helped the man yesterday’ 


One could therefore assume that gestern ‘yesterday’ and dem Mann ‘the man’, which of 
course do not form a constituent, have been moved together. An alternative explanation 
for the ordering variants in (23) is that adverbs can occur in various positions in the 
clause and that only dem Mann ‘the man’ has been moved in front of keiner ‘nobody’ in 
(23b). In any case, it is clear that gestern and dem Mann have no semantic relation and 
that it is impossible to refer to both of them with a pronoun. Although it may seem at 
first glance as if this material had been moved as a unit, we have seen that it is in fact 
not tenable to assume that gestern dem Mann ‘yesterday the man’ forms a constituent. 


1.3.2.3 Fronting 


As mentioned in the discussion of (15), the position in front of the finite verb is normally 
occupied by a single constituent. The possibility for a given word sequence to be placed 
in front of the finite verb is sometimes even used as a clear indicator of constituent status, 
and even used in the definition of Satzglied?. An example of this is taken from Bußmann 
(1983), but is no longer present in Bußmann (1990): 


Satzglied test A procedure based on — topicalization used to analyze complex con- 
stituents. Since topicalization only allows a single constituent to be moved to the 
beginning of the sentence, complex sequences of constituents, for example adverb 
phrases, can be shown to actually consist of one or more constituents. In the ex- 
ample Ein Taxi quält sich im Schrittempo durch den Verkehr ‘A taxi was struggling 
at walking speed through the traffic’, im Schrittempo ‘at walking speed’ and durch 
den Verkehr ‘through the traffic’ are each constituents as both can be fronted inde- 
pendently of each other. (Bußmann 1983: 446) 


? Satzglied is a special term used in grammars of German, referring to a constituent on the clause level 
(Eisenberg et al. 2005: 783). 

^The original formulation is: Satzgliedtest [Auch: Konstituententest]. Auf der > Topikalisierung beruhen- 
des Verfahren zur Analyse komplexer Konstituenten. Da bei Topikalisierung jeweils nur eine Konstituente 
bzw. ein — Satzglied an den Anfang gerückt werden kann, lassen sich komplexe Abfolgen von Kon- 
stituenten (z. B. Adverbialphrasen) als ein oder mehrere Satzglieder ausweisen; in Ein Taxi quált sich im 
Schrittempo durch den Verkehr sind im Schrittempo und durch den Verkehr zwei Satzglieder, da sie beide 
unabhängig voneinander in Anfangsposition gerückt werden können. 
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The preceding quote has the following implications: 


« Some part of a piece of linguistic material can be fronted independently — 
This material does not form a constituent. 


e Linguistic material can be fronted together — 
This material forms a constituent. 


It will be shown that both of these prove to be problematic. 
The first implication is cast into doubt by the data in (24): 


(24) a. Keine Einigung erreichten Schröder und Chirac über den Abbau der 
no agreement reached Schröder and Chirac about the reduction of.the 
Agrarsubventionen.? 
agricultural.subsidies 
‘Schröder and Chirac could not reach an agreement on the reduction of agri- 
cultural subsidies: 

b. [Über den Abbau der  Agrarsubventionen] erreichten Schróder und 
about the reduction of.the agricultural.subsidies reached Schröder and 
Chirac keine Einigung. 
Chiracno agreement 


Although parts of the noun phrase keine Einigung über den Abbau der Agrarsubventionen 
‘no agreement on the reduction of agricultural subsidies’ can be fronted individually, we 
still want to analyze the entire string as a noun phrase when it is not fronted as in (25): 


(25) Schröder und Chirac erreichten [keine Einigung über den Abbau der 
Schröder and Chirac reached no agreement about the reduction of.the 
Agrarsubventionen]. 
agricultural.subsidies 


The prepositional phrase über den Abbau der Agrarsubventionen ‘on the reduction of 
agricultural subsidies’ is semantically dependent on Einigung ‘agreement’ cf. (26): 


(26) Sie einigen sich über die Agrarsubventionen. 
they agree REFL about the agricultural.subsidies 


“They agree on the agricultural subsidies: 
This word sequence can also be fronted together: 


(27) [Keine Einigung über den Abbau der Agrarsubventionen] erreichten 
no agreement about the reduction of.the agricultural.subsidies reached 
Schröder und Chirac. 

Schröder and Chirac 


In the theoretical literature, it is assumed that keine Einigung über den Abbau der Agrar- 
subventionen forms a constituent which can be “split up” under certain circumstances. 


Stagesschau, 15.10.2002, 20:00. 
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In such cases, the individual subconstituents can be moved independently of each other 
(De Kuthy 2002) as we have seen in (25). 
The second implication is problematic because of examples such as (28): 


(28) a. [Trocken] [durch die Stadt] kommt man am Wochenende auch mit der 
dry through the city comes one at.the weekend also with the 
BVG.° 
BVG 


"With the BVG, you can be sure to get around town dry at the weekend’ 


b. [Wenig] [mit Sprachgeschichte] hat der dritte Beitrag in dieser Rubrik 
little withlanguage.history has the third contribution in this section 
zu tun, [...]’ 
to do 


‘The third contribution in this section has little to do with language history’ 


In (28), there are multiple constituents preceding the finite verb, which bear no obvious 
syntactic or semantic relation to each other. Exactly what is meant by a “syntactic or 
semantic relation” will be fully explained in the following chapters. At this point, I will 
just point out that in (28a) the adjective trocken ‘dry’ has man ‘one’ as its subject and 
furthermore says something about the action of ‘travelling through the city’. That is, it 
refers to the action denoted by the verb. As (29b) shows, durch die Stadt ‘through the 
city’ cannot be combined with the adjective trocken ‘dry’. 


(29) a. Man ist / bleibt trocken. 
one is stays dry 


“One is/stays dry. 


b. * Man ist / bleibt trocken durch die Stadt. 
one is stays dry through the city 


Therefore, the adjective trocken ‘dry’ does not have a syntactic or semantic relationship 
with the prepositional phrase durch die Stadt ‘through the city’. Both phrases have in 
common that they refer to the verb and are dependent on it. 

One may simply wish to treat the examples in (28) as exceptions. This approach would, 
however, not be justified, as I have shown in an extensive empirical study (Miiller 2003a). 

If one were to classify trocken durch die Stadt as a constituent due to it passing the 
fronting test, then one would have to assume that trocken durch die Stadt in (30) is also 
a constituent. In doing so, we would devalue the term constituent as the whole point 
of constituent tests is to find out which word strings have some semantic or syntactic 
relationship.” 


5taz berlin, 10.07.1998, p. 22. 

7 Zeitschrift für Dialektologie und Linguistik, LXIX, 3/2002, p. 339. 

"These data can be explained by assuming a silent verbal head preceding the finite verb and thereby en- 
suring that there is in fact just one constituent in initial position in front of the finite verb (Müller 2005c; 
2017a). Nevertheless, this kind of data are problematic for constituent tests since these tests have been 
specifically designed to tease apart whether strings such as trocken and durch die Stadt or wenig and mit 
Sprachgeschichte in (30) form a constituent. 
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(30) a. Man kommt am Wochenende auch mit der BVG trocken durch die 
one comes at.the weekend also with the BVG dry through the 
Stadt. 
city 
"With the BVG, you can be sure to get around town dry at the weekend’ 


b. Der dritte Beitrag in dieser Rubrik hat wenig mit Sprachgeschichte zu 
the third contribution in this section has little with language.history to 
tun. 


do 

"Ihe third contribution in this section has little to do with language history: 
The possibility for a given sequence of words to be fronted is therefore not a sufficient 
diagnostic for constituent status. 


We have also seen that it makes sense to treat expletives as constituents despite the 
fact that the accusative expletive cannot be fronted (cf. (20a)): 


(31 a. Erbringtes bis zum Professor. 
he brings ExPL until to.the professor 
‘He makes it to professor? 
b. #Es bringt er bis zum Professor. 


it brings he until to.the professor 


There are other elements that can also not be fronted. Inherent reflexives are a good 
example of this: 


(32) a. Karl hat sich nicht erholt. 
Karl has REFL not recovered 


‘Karl hasn’t recovered.’ 


b. * Sich hat Karl nicht erholt. 
REFL has Karl not recovered 


It follows from this that fronting is not a necessary criterion for constituent status. There- 
fore, the possibility for a given word string to be fronted is neither a necessary nor suf- 
ficient condition for constituent status. 


1.3.2.4 Coordination 


Coordinated structures such as those in (33) also prove to be problematic: 


(33) Deshalb kaufte der Mann einen Esel und die Frau ein Pferd. 
therefore bought the man a donkey and the woman a horse 
“Therefore, the man bought a donkey and the woman a horse! 
At first glance, der Mann einen Esel ‘the man a donkey’ and die Frau ein Pferd ‘the woman 


a horse’ in (33) seem to be coordinated. Does this mean that der Mann einen Esel and die 
Frau ein Pferd each form a constituent? 
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As other constituent tests show, this assumption is not plausible. This sequence of 
words cannot be moved together as a unit:” 


(34) “Der Mann einen Esel kaufte deshalb. 
the man a donkey bought therefore 


Replacing the supposed constituent is also not possible without ellipsis: 


(35) a. #Deshalb kaufte er. 
therefore bought he 


b. * Deshalb kaufte ihn. 
therefore bought him 


The pronouns do not stand in for the two logical arguments of kaufen ‘to buy’, which 
are realized by der Mann ‘the man’ and einen Esel ‘a donkey’ in (33), but rather for one 
in each. There are analyses that have been proposed for examples such as (33) in which 
two verbs kauft ‘buys’ occur, where only one is overt, however (Crysmann 2008). The 
example in (33) would therefore correspond to: 


(36) Deshalb kaufte der Mann einen Esel und kaufte die Frau ein Pferd. 
therefore bought the man a donkey and bought the woman a horse 


This means that although it seems as though der Mann einen Esel ‘the man a donkey’ 
and die Frau ein Pferd ‘the woman a horse’ are coordinated, it is actually kauft der Mann 
einen Esel ‘buys the man a donkey’ and (kauft) die Frau ein Pferd ‘buys the woman a 
horse’ which are conjoined. 

We should take the following from the previous discussion: even when a given word 
sequence passes certain constituent tests, this does not mean that one can automatically 
infer from this that we are dealing with a constituent. That is, the tests we have seen are 
not sufficient conditions for constituent status. 

Summing up, it has been shown that these tests are neither sufficient nor necessary 
for attributing constituent status to a given sequence of words. However, as long as one 
keeps the problematic cases in mind, the previous discussion should be enough to get 
an initial idea about what should be treated as a constituent. 


1.4 Parts of speech 


The words in (37) differ not only in their meaning but also in other respects. 


(37) Der große Biber schwimmt jetzt. 
the big beaver swims now 


‘The big beaver swims now. 


"The area in front of the finite verb is also referred to as the Vorfeld ‘prefield’ (see Section 1.8). Apparent 
multiple fronting is possible under certain circumstances in German. See the previous section, especially 
the discussion of the examples in (28) on page 15. The example in (34) is created in such a way that the 
subject is present in the prefield, which is not normally possible with verbs such as kaufen ‘to buy’ for rea- 
sons which have to do with the information-structural properties of these kinds of fronting constructions. 
Compare also De Kuthy & Meurers 2003b on subjects in fronted verb phrases and Bildhauer & Cook 2010: 
72 on frontings of subjects in apparent multiple frontings. 
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Each of the words is subject to certain restrictions when forming sentences. It is common 
practice to group words into classes with other words which share certain salient prop- 
erties. For example, der ‘the’ is an article, Biber ‘beaver’ is a noun, schwimmt ‘swims’ is 
a verb and jetzt ‘now’ is an adverb. As can be seen in (38), it is possible to replace all the 
words in (37) with words from the same word class. 


(38) Die kleine Raupe frisst immer. 
the small caterpillar eats always 


‘The small caterpillar is always eating. 


This is not always the case, however. For example, it is not possible to use a verb such 
as verschlingt ‘devours’ or the second-person form schwimmst in (38). This means that 
the categorization of words into parts of speech is rather coarse and that we will have to 
say a lot more about the properties of a given word. In this section, I will discuss various 
word classes/parts of speech and in the following sections I will go into further detail 
about the various properties which characterize a given word class. 

The most important parts of speech are verbs, nouns, adjectives, prepositions and ad- 
verbs. In earlier decades, it was common among researchers working on German (see 
also Section 11.6.1 on Tesniére’s category system) to speak of action words, describing 
words, and naming words. These descriptions prove problematic, however, as illustrated 
by the following examples: 


(39) a. die Idee 
the idea 


b. die Stunde 
the hour 


c. das laute Sprechen 
the loud speaking 


“(the act of) speaking loudly’ 
d Die Erörterung der Lage dauerte mehrere Stunden. 
the discussion of.the situation lasted several hours 


“The discussion of the situation lasted several hours: 


(39a) does not describe a concrete entity, (39b) describes a time interval and (39c) and 
(39d) describe actions. It is clear that Idee ‘idea’, Stunde ‘hour’, Sprechen ‘speaking’ and 
Erörterung ‘discussion’ differ greatly in terms of their meaning. Nevertheless, these 
words still behave like Raupe ‘caterpillar’ and Biber ‘beaver’ in many respects and are 
therefore classed as nouns. 

The term action word is not used in scientific linguistic work as verbs do not always 
need to denote actions: 


(40) a. Ihm gefällt das Buch. 
him pleases the book 


‘He likes the book’ 
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b. Das Eis schmilzt. 
the ice melts 
"Ihe ice is melting: 

c. Es regnet. 
it rains 
‘It is raining? 

One would also have to class the noun Erérterung ‘discussion’ as an action word. 
Adjectives do not always describe properties of objects. In the following examples, 


the opposite is in fact true: the characteristic of being a murderer is expressed as being 
possible or probable, but not as being true properties of the modified noun. 


(41) a. der mutmaßliche Mörder 
the suspected murderer 


b. Soldaten sind potenzielle Mérder. 
soldiers are potential murderers 


The adjectives themselves in (41) do not actually provide any information about the char- 
acteristics of the entities described. One may also wish to classify lachende ‘laughing’ in 
(42) as an adjective. 


(42) der lachende Mann 
the laughing man 


If, however, we are using properties and actions as our criteria for classification, lachend 
‘laughing’ should technically be an action word. 

Rather than semantic criteria, it is usually formal criteria which are used to determine 
word classes. The various forms a word can take are also taken into account. So lacht 
‘laughs’, for example, has the forms given in (43). 


(43) a. Ich lache. 
I laugh 


b. Du  lachst. 
you.sc laugh 

c. Er lacht. 
he laughs 

d. Wir lachen. 
we laugh 

e. Ihr lacht. 
you.pL laugh 


f. Sie lachen. 
they laugh 
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In German, there are also forms for the preterite, imperative, present subjunctive, past 
subjunctive and non-finite forms (participles and infinitives with or without zu ‘to’). All 
of these forms constitute the inflectional paradigm of a verb. Tense (present, preterite, 
future), mood (indicative, subjunctive, imperative), person (1st, 2nd, 3rd) and number 
(singular, plural) all play a role in the inflectional paradigm. Certain forms can coincide 
in a paradigm, as (43c) and (43e) and (43d) and (43f) show. 

Parallel to verbs, nouns also have an inflectional paradigm: 


(44) a. der Mann 
the.NoM man 


b. des Mannes 


the.GEN man.GEN 


c. dem Mann 
the.DAT man 


d. den Mann 
the.acc man 


e. die Manner 
the.NoM men 


f. der Manner 
the.GEN men 


g. den Männern 
the.DAT men.DAT 


h. die Männer 
the.acc men 


We can differentiate between nouns on the basis of gender (feminine, masculine, neuter). 
The choice of gender is often purely formal in nature and is only partially influenced by 
biological sex or the fact that we are describing a particular object: 


(45) a. die Tüte 
the.r bag(F) 
“the bag’ 


b. der Krampf 
the.m cramp(M) 


‘cramp’ 


c. das Kind 
the.N child(N) 


‘the child’ 


As well as gender, case (nominative, genitive, dative, accusative) and number are also 
important for nominal paradigms. 
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Like nouns, adjectives inflect for gender, case and number. They differ from nouns, 
however, in that gender marking is variable. Adjectives can be used with all three gen- 
ders: 


(46) a. eineschóne Blume 
a. beautiful.r flower 


b. einschöner Strauß 
a beautiful.m bunch 


c. einschönes Bouquet 
a beautiful.N bouquet 


In addition to gender, case and number, we can identify several inflectional classes. Tra- 
ditionally, we distinguish between strong, mixed and weak inflection of adjectives. The 
inflectional class that we have to choose is dependent on the form or presence of the 
article: 


(47) a. ein alter Wein 
an old wine 


b. der alte Wein 
the old wine 


c. alter Wein 
old wine 


Furthermore, adjectives have comparative and superlative wordforms: 


(48) a. klug 


clever 


b. klüg-er 


clever-er 


c. am  klüg-sten 
at.the clever-est 


This is not always the case. Especially for adjectives which make reference to some end 
point, a degree of comparison does not make sense. If a particular solution is optimal, 
for example, then no better one exists. Therefore, it does not make sense to speak of a 
“more optimal” solution. In a similar vein, it is not possible to be “deader” than dead. 
There are some special cases such as color adjectives ending in -a in German lila ‘pur- 
ple’ and rosa ‘pink’. These inflect optionally (49a), and the uninflected form is also pos- 


sible: 


(49) a. einelilan-e Blume 
a ` purple-F flower 


b. eine lila Blume 
a purple flower 
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In both cases, lila is classed an adjective. We can motivate this classification by appealing 
to the fact that both words occur at the same positions as other adjectives that clearly 
behave like adjectives with regard to inflection. 

The parts of speech discussed thus far can all be differentiated in terms of their inflec- 
tional properties. For words which do not inflect, we have to use additional criteria. For 
example, we can classify words by the syntactic context in which they occur (as we did 
for the non-inflecting adjectives above). We can identify prepositions, adverbs, conjunc- 
tions, interjections and sometimes also particles. Prepositions are words which occur 
with a noun phrase whose case they determine: 


(50) a. indiesen Raum 
in this.acc room 


b. indiesem Raum 
in this.DAT room 


wegen because’ is often classed as a preposition although it can also occur after the noun 
and in these cases would technically be a postposition: 


(51) des Geldes wegen 
the money.GEN because 


‘because of the money’ 


It is also possible to speak of adpositions if one wishes to remain neutral about the exact 
position of the word. 
Unlike prepositions, adverbs do not require a noun phrase. 


(52) a. Er schlaft in diesem Raum. 
he sleeps inthis room 


b. Er schlaft dort. 
he sleeps there 


Sometimes adverbs are simply treated as a special variant of prepositions (see page 94). 
The explanation for this is that a prepositional phrase such as in diesem Raum ‘in this 
room’ shows the same syntactic distribution as the corresponding adverbs. in differs 
from dort ‘there’ in that it needs an additional noun phrase. These differences are parallel 
to what we have seen with other parts of speech. For instance, the verb schlafen ‘sleep’ 
requires only a noun phrase, whereas erkennen ‘recognize’ requires two. 


(53) a. Er schläft. 
he sleeps 


b. Peter erkennt ihn. 
Peter recognizes him 


Conjunctions can be subdivided into subordinating and coordinating conjunctions. 
Coordinating conjunctions include und ‘and’ and oder ‘or’. In coordinate structures, two 
units with the same syntactic properties are combined. They occur adjacent to one an- 
other. dass ‘that’ and weil ‘because’ are subordinating conjunctions because the clauses 
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that they introduce can be part of a larger clause and depend on another element of this 
larger clause. 


(54) a. Klaus glaubt, dass er liigt. 
Klaus believes that he lies 


‘Klaus believes that he is lying’ 


b. Klaus glaubt ihm nicht, weil er ligt. 
Klaus believes him not because he lies 


‘Klaus doesn’t believe him because he is lying’ 


Interjections are clause-like expressions such as Ja! “Yes!’, Bitte! ‘Please!’ Hallo! 'Hel- 
lo", Hurra! "Hooray!’, Bravo! ‘Bravo!’, Pst! ‘Psst!’, Plumps! ‘Clonk!’. 

If adverbs and prepositions are not assigned to the same class, then adverbs are nor- 
mally used as a kind of “left over” category in the sense that all non-inflecting words 
which are neither prepositions, conjunctions nor interjections are classed as adverbs. 
Sometimes this category for “left overs” is subdivided: only words which can appear 
in front of the finite verb when used as a constituent are referred to as adverbs. Those 
words which cannot be fronted are dubbed particles. Particles themselves can be subdi- 
vided into various classes based on their function, e.g., degree particles and illocutionary 
particles. Since these functionally defined classes also contain adjectives, I will not make 
this distinction and simply speak of adverbs. 

We have already sorted a considerable number of inflectional words into word classes. 
When one is faced with the task of classifying a particular word, one can use the decision 
diagram in Figure 1.2 on the next page, which is taken from the Duden grammar of 
German (Eisenberg et al. 2005: 133). 

If a word inflects for tense, then it is a verb. If it displays different case forms, then 
one has to check if it has a fixed gender. If this is indeed the case, then we know that 
we are dealing with a noun. Words with variable gender have to be checked to see if 
they have comparative forms. A positive result will be a clear indication of an adjec- 
tive. All other words are placed into a residual category, which the Duden refers to as 
pronouns/article words. Like in the class of non-inflectional elements, the elements in 
this remnant category are subdivided according to their syntactic behavior. The Duden 
grammar makes a distinction between pronouns and article words. According to this 
classification, pronouns are words which can replace a noun phrase such as der Mann 
‘the man’, whereas article words normally combine with a noun. In Latin grammars, 
the notion of ‘pronoun’ includes both pronouns in the above sense and articles, since 
the forms with and without the noun are identical. Over the past centuries, the forms 
have undergone split development to the point where it is now common in contempo- 
rary Romance languages to distinguish between words which replace a noun phrase and 
those which must occur with a noun. Elements which belong to the latter class are also 
referred to as determiners. 


The Duden is the official document for the German orthography. The Duden grammar does not have an 
official status but is very influential and is used for educational purposes as well. I will refer to it several 
times in this introductory chapter. 
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part of speech 
ee 
inflects does not inflect 
Do E QC c 
for tense for case 
E di c 
fixed gender flexible gender 


uero Pe, 


nocomparative comparative 


verb noun article word adjective adverb 
pronoun conjunction 
preposition 
interjection 


Figure 1.2: Decision tree for determining parts of speech following Eisenberg et al. (2005: 
133) 


If we follow the decision tree in Figure 1.2, the personal pronouns ich T, du ‘you’, er 
‘he’, sie ‘her’, es ‘it’, wir ‘we’, ihr ‘you’, and sie ‘they’, for example, would be grouped 
together with the possessive pronouns mein ‘mine’, dein ‘your’, sein ‘his’/‘its’, ihr ‘her’/ 
‘their’, unser ‘our’, and euer ‘your’. The corresponding reflexive pronouns, mich ‘myself’, 
dich ‘yourself’, sich ‘himself’/‘herself’/‘itself’, ‘themselves’, uns ‘ourselves’, euch ‘your- 
self’, and the reciprocal pronoun einander ‘each other’ have to be viewed as a special 
case in German as there are no differing gender forms of sich ‘himself’ /‘herself’/‘itself 
and einander ‘each other’. Case is not expressed morphologically by reciprocal pronouns. 
By replacing genitive, dative and accusative pronouns with einander, it is possible to see 
that there must be variants of einander ‘each other’ in these cases, but these variants all 
share the same form: 


(55) a. Sie gedenken seiner / einander. 
they commemorate him.GEN  each.other 


b. Sie helfen ihm / einander. 
they help ` bim par each.other 


c. Sie lieben ihn / einander. 
they love him.acc each.other 


So-called pronominal adverbs such as darauf ‘on there’, darin ‘in there’, worauf ‘on 
where’, worin ‘in where’ also prove problematic. These forms consist of a preposition 
(e.g., auf ‘on’) and the elements da ‘there’ and wo ‘where’. As the name suggests, 
pronominal adverbs contain something pronominal and this can only be da ‘there’ and 


24 


14 Parts of speech 


wo ‘where’. However, da ‘there’ and wo ‘where’ do not inflect and would therefore, 
following the decision tree, not be classed as pronouns. 
The same is true of relative pronouns such as wo ‘where’ in (56): 


(56) a. Ichkomme eben aus der Stadt, wo ich Zeuge eines Unglücks gewesen 
I come PART fromthe city whereI witness of.an accident been 
bin.” 

am 

‘I come from the city where I was witness to an accident: 


b. Studien haben gezeigt, daf mehr Unfälle in Städten passieren, wo die 
studies have shown that more accidents in cities happen where the 


Zebrastreifen abgebaut werden, weil die Autofahrer unaufmerksam 
zebra.crossings removed become because the drivers unattentive 
werden.” 

become 


"Studies have shown that there are more accidents in cities where they do 
away with zebra crossings, because drivers become unattentive’ 


c. Zufällig war ich in dem Augenblick zugegen, wo der Steppenwolf 
coincidentally was I inthe moment present where the Steppenwolf 
zum erstenmalunser Haus betrat und bei meiner Tante sich einmietete.? 
to.the first.time our house enteredand by my aunt rEFLtook.lodgings 


‘Coincidentally, I was present at the exact moment in which Steppenwolf en- 
tered our house for the first time and took lodgings with my aunt: 


If they are uninflected, then they cannot belong to the class of pronouns according to 
the decision tree above. Eisenberg (2004: 277) notes that wo ‘where’ is a kind of unin- 
flected relative pronoun (he uses quotation marks) and remarks that this term runs con- 
trary to the exclusive use of the term pronoun for nominal, that is, inflected, elements. 
He therefore uses the term relative adverb for them (see also Eisenberg et al. (2005: §856, 
8857)). 

There are also usages of the relatives dessen ‘whose’ and wessen ‘whose’ in combina- 
tion with a noun: 


(57) a. der Mann, dessen Schwester ich kenne 
the man whose sister I know 


b. Ich möchte wissen, wessen Schwester du kennst. 
I wouldlike know whose sister you know 


‘I would like to know whose sister you know. 


According to the classification in the Duden, these should be covered by the terms Rel- 
ativartikelwort ‘relative article word’ and Interrogativartikelwort 'interrogative article 


"Drosdowski (1984: 672). 
taz berlin, 03.11.1997, p. 23. 
BHerman Hesse, Der Steppenwolf. Berlin und Weimar: Aufbau-Verlag. 1986, p. 6. 
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word’. They are mostly counted as part of the relative pronouns and question pronouns 
(see for instance Eisenberg (2004: 229)). Using Eisenberg’s terminology, this is unprob- 
lematic as he does not make a distinction between articles, pronouns and nouns, but 
rather assigns them all to the class of nouns. But authors who do make a distinction 
between articles and pronouns sometimes also speak of interrogative pronouns when 
discussing words which can function as articles or indeed replace an entire noun phrase. 

One should be prepared for the fact that the term pronoun is often simply used for 
words which refer to other entities and, this is important, not in the way that nouns 
such as book and John do, but rather dependent on context. The personal pronoun er 
‘he’ can, for example, refer to either a table or a man. This usage of the term pronoun 
runs contrary to the decision tree in Figure 1.2 and includes uninflected elements such 
as da ‘there’ and wo ‘where’. 

Expletive pronouns such as es ‘it’ and das ‘that’, as well as the sich ‘him’/‘her’/‘itself’ 
belonging to inherently reflexive verbs, do not make reference to actual objects. They 
are considered pronouns because of the similarity in form. Even if we were to assume a 
narrow definition of pronouns, we would still get the wrong results as expletive forms 
do not vary with regard to case, gender and number. If one does everything by the book, 
expletives would belong to the class of uninflected elements. If we assume that es ‘it’ as 
well as the personal pronouns have a nominative and accusative variant with the same 
form, then they would be placed in with the nominals. We would then have to admit 
that the assumption that es has gender would not make sense. That is we would have to 
count es as a noun by assuming neuter gender, analogous to personal pronouns. 

We have not yet discussed how we would deal with the italicized words in (58): 


(58) a. das geliebte Spielzeug 
the beloved toy 


b. das schlafende Kind 
the sleeping child 


c. die Frage des  Sprechens und Schreibens über Gefühle 
the question of.the talking and writing about feelings 
“the question of talking and writing about feelings’ 

d. Auf dem Europa-Parteitag fordern die Grünen einen ökosozialen 
on the Europe-party.conference demand the Greens a eco-social 
Politikwechsel. 
political.change 
‘At the European party conference, the Greens demanded eco-social political 
change! 

e. Maxlacht laut. 

Max laughs loudly 

f. Max würde wahrscheinlich lachen. 

Max would probably laugh 


geliebte ‘beloved’ and schlafende ‘sleeping’ are participle forms of lieben ‘to love’ and 
schlafen ‘to sleep’. These forms are traditionally treated as part of the verbal paradigm. 
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In this sense, geliebte and schlafende are verbs. This is referred to as lexical word class. 
The term lexeme is relevant in this case. All forms in a given inflectional paradigm belong 
to the relevant lexeme. In the classic sense, this term also includes the regularly derived 
forms. That is participle forms and nominalized infinitives also belong to a verbal lex- 
eme. Not all linguists share this view, however. Particularly problematic is the fact that 
we are mixing verbal with nominal and adjectival paradigms. For example, Sprechens 
'speaking.GEN' is in the genitive case and adjectival participles also inflect for case, num- 
ber and gender. Furthermore, it is unclear as to why schlafende ‘sleeping’ should be 
classed as a verbal lexeme and a noun such as Störung ‘disturbance’ is its own lexeme 
and does not belong to the lexeme stören ‘to disturb’. I subscribe to the more modern 
view of grammar and assume that processes in which a word class is changed result in 
a new lexeme being created. Consequently, schlafende ‘sleeping’ does not belong to the 
lexeme schlafen ‘to sleep’, but is a form of the lexeme schlafend. This lexeme belongs to 
the word class ‘adjective’ and inflects accordingly. 

As we have seen, it is still controversial as to where to draw the line between inflection 
and derivation (creation of a new lexeme). Sag, Wasow & Bender (2003: 263-264) view 
the formation of the present participle (standing) and the past participle (eaten) in English 
as derivation as these forms inflect for gender and number in French. 

Adjectives such as Griinen ‘the Greens’ in (58d) are nominalized adjectives and are 
written with a capital like other nouns in German when there is no other noun that can 
be inferred from the immediate context: 


(59) A: Willst du den roten Ball haben? 
want youthe red ball have 
"Do you want the red ball?’ 
B: Nein, gib mir bitte den grünen. 
no give me please the green 


‘No, give me the green one, please! 


In the answer to (59), the noun Ball has been omitted. This kind of omission is not 
present in (58d). One could also assume here that a word class change has taken place. 
If a word changes its class without combination with a visible affix, we refer to this as 
conversion. Conversion has been treated as a sub-case of derivation by some linguists. 
The problem is, however, that Griine ‘greens’ inflects just like an adjective and the gender 
varies depending on the object it is referring to: 


(60) a. Ein Grüner hat vorgeschlagen, ... 
a green.M has suggested 
‘A (male) member of the Green Party suggested ..? 


b. Eine Griine hat vorgeschlagen, ... 
a green.F has suggested 


‘A (female) member of the Green Party suggested ..? 


We also have the situation where a word has two properties. We can make life easier 
for ourselves by talking about nominalized adjectives. The lexical category of Griine is 
adjective and its syntactic category is noun. 
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The word in (58e) can inflect like an adjective and should therefore be classed as an 
adjective following our tests. Sometimes, these kinds of adjectives are also classed as 
adverbs. The reason for this is that the uninflected forms of these adjectives behave like 
adverbs: 


(61) Max lacht immer /oft / laut. 
Max laughs always often loud 


"Max (always/often) laughs (loudly): 


To capture this dual nature of words some researchers distinguish between lexical and 
syntactic category of words. The lexical category of laut ‘loud(ly)’ is that of an adjective 
and the syntactic category to which it belongs is ‘adverb’. The classification of adjectives 
such as laut ‘loud(ly)’ in (61) as adverbs is not assumed by all authors. Instead, some 
speak of adverbial usage of an adjective, that is, one assumes that the syntactic category 
is still adjective but it can be used in a different way so that it behaves like an adverb 
(see Eisenberg 2004: Section 7.3, for example). This is parallel to prepositions, which can 
occur in a variety of syntactic contexts: 


(62) a. Peter schlaftim Büro. 
Peter sleeps in.the office 
‘Peter sleeps in the office’ 


b. der Tischim Büro 
the table in.the office 


“the table in the office’ 
We have prepositional phrases in both examples in (62); however, in (62a) im Büro ‘in 
the office’ acts like an adverb in that it modifies the verb schläft ‘sleeps’ and in (62b) im 


Büro modifies the noun Tisch ‘table’. In the same way, laut ‘loud’ can modify a noun (63) 
or a verb (61). 


(63) die laute Musik 
the loud music 


1.5 Heads 


The head of a constituent/phrase is the element which determines the most important 
properties of the constituent/phrase. At the same time, the head also determines the 
composition of the phrase. That is, the head requires certain other elements to be present 
in the phrase. The heads in the following examples have been marked in italics: 


(64) a. Träumt dieser Mann? 
dreams this.NoM man 


‘Does this man dream?’ 
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b. Erwartet er diesen Mann? 
expects he.NoM this.Acc man 


‘Is he expecting this man?’ 


c. Hilft er diesem Mann? 
helps he.NoM this.DAT man 


‘Is he helping this man?’ 
d. in diesem Haus 
in this.DAT house 


e. ein Mann 
a.NOM man 


Verbs determine the case of their arguments (subjects and objects). In (64d), the preposi- 
tion determines which case the noun phrase diesem Haus 'this house' bears (dative) and 
also determines the semantic contribution of the phrase (it describes a location). (64e) 
is controversial: there are linguists who believe that the determiner is the head (Venne- 
mann & Harlow 1977; Hellan 1986; Abney 1987; Netter 1994; 1998) while others assume 
that the noun is the head of the phrase (Van Langendonck 1994; Pollard & Sag 1994: 49; 
Demske 2001; Müller 20072: Section 6.6.1; Hudson 2004; Bruening 2009). 

The combination of a head with another constituent is called a projection of the head. 
A projection which contains all the necessary parts to create a well-formed phrase of 
that type is a maximal projection. A sentence is the maximal projection of a finite verb. 

Figure 1.3 shows the structure of (65) in box representation. 


(65) DerMannliest einen Aufsatz. 
the man readsan essay 


"Ihe man is reading an essay: 


Unlike Figure 1.1, the boxes have been labelled here. 


NP 
Det N 
einen || Aufsatz 


Figure 1.3: Words and phrases in annotated boxes 


The annotation includes the category of the most important element in the box. VP 
stands for verb phrase and NP for noun phrase. VP and NP are maximal projections of 
their respective heads. 
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Anyone who has ever faced the hopeless task of trying to find particular photos of 
their sister’s wedding in a jumbled, unsorted cupboard can vouch for the fact that it is 
most definitely a good idea to mark the boxes based on their content and also mark the 
albums based on the kinds of photos they contain. 

An interesting point is that the exact content of the box with linguistic material does 
not play a role when the box is put into a larger box. It is possible, for example, to replace 
the noun phrase der Mann ‘the man’ with er ‘he’, or indeed the more complex der Mann 
aus Stuttgart, der das Seminar zur Entwicklung der Zebrafinken besucht ‘the man from 
Stuttgart who takes part in the seminar on the development of zebra finches’. However, 
it is not possible to use die Manner ‘the men’ or des Mannes ‘of the man’ in this position: 


(66) a. *Die Manner liest einen Aufsatz. 
the men readsan essay 


b. "Des Mannes liest einen Aufsatz. 
of.the man.GEN reads an essay 


The reason for this is that die Männer ‘the men’ is in plural and the verb liest ‘reads’ 
is in singular. The noun phrase bearing genitive case des Mannes can also not occur, 
only nouns in the nominative case. It is therefore important to mark all boxes with the 
information that is important for placing these boxes into larger boxes. Figure 1.4 shows 
our example with more detailed annotation. 


NP, nom, 3, sg NP, acc, 3, sg 


Det, nom, mas, sg || N, nom, mas, sg Det, acc, mas, sg || N, acc, mas, sg 
der Mann einen Aufsatz 


Figure 1.4: Words and word strings in annotated boxes 


The features ofa head which are relevant for determining in which contexts a phrase 
can occur are called head features. The features are said to be projected by the head. 


1.6 Arguments and adjuncts 


The constituents of a given clause have different relations to their head. It is typical to 
distinguish between arguments and adjuncts. The syntactic arguments of a head corre- 
spond for the most part to their logical arguments. We can represent the meaning of 
(67a) as (67b) using predicate logic. 
(67) a. Peter helps Maria. 

b. help’(peter’, maria’) 
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The logical representation of (67b) resembles what is expressed in (67a); however, it ab- 
stracts away from constituent order and inflection. Peter and Maria are syntactic argu- 
ments of the verb help and their respective meanings (Peter’ and Maria’) are arguments 
of the logical relation expressed by help’. One could also say that help assigns semantic 
roles to its arguments. Semantic roles include agent (the person carrying out an action), 
patient (the affected person or thing), beneficiary (the person who receives something) 
and experiencer (the person experiencing a psychological state). The subject of help is 
an agent and the direct object is a beneficiary. Arguments which fulfil a semantic role 
are also called actants. This term is also used for inanimate objects. 

This kind of relation between a head and its arguments is covered by the terms selection 
and valence. Valence is a term borrowed from chemistry. Atoms can combine with other 
atoms to form molecules with varying levels of stability. The way in which the electron 
shells are occupied plays an important role for this stability. If an atom combines with 
others atoms so that its electron shell is fully occupied, then this will lead to a stable 
connection. Valence tells us something about the number of hydrogen atoms which an 
atom of a certain element can be combined with. In forming H,O, oxygen has a valence 
of 2. We can divide elements into valence classes. Following Mendeleev, elements with 
a particular valence are listed in the same column in the periodic table. 

The concept of valence was applied to linguistics by Tesniére (1959): a head needs 
certain arguments in order to form a stable compound. Words with the same valence - 
that is which require the same number and type of arguments - are divided into valence 
classes. Figure 1.5 shows examples from chemistry as well as linguistics. 


O help 
ER BER 
HH Peter Mary 


Figure 1.5: Combination of hydrogen and oxygen and the combination of a verb with its 
arguments 


We used (67) to explain logical valence. Logical valence can, however, sometimes differ 
from syntactic valence. This is the case with verbs like rain, which require an expletive 
pronoun as an argument. Inherently reflexive verbs such as sich erholen ‘to recover’ in 
German are another example. 


(68) a. Esregnet. 
it rains 
‘It is raining? 
b. Klaus erholt sich. 
Klaus recovers REFL 
‘Klaus is recovering’ 


The expletive es ‘it’ with weather verbs and the sich of so-called inherent reflexives such 
as erholen ‘to recover’ have to be present in the sentence. Germanic languages have 
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expletive elements that are used to fill the position preceding the finite verb. These 
positional expletives are not realized in embedded clauses in German, since embedded 
clauses have a structure that differs from canonical unembedded declarative clauses, 
which have the finite verb in second position. (69a) shows that es cannot be omitted in 
dass-clauses. 


(69) a. *Ich glaube, dass regnet. 
I think that rains 


Intended: ‘I think that it is raining’ 


b. *Ich glaube, dass Klaus erholt. 
I believe that Klaus recovers 


Intended: ‘I believe that Klaus is recovering’ 
Neither the expletive nor the reflexive pronoun contributes anything semantically to the 
sentence. They must, however, be present to derive a complete, well-formed sentence. 
They therefore form part of the valence of the verb. 
Constituents which do not contribute to the central meaning of their head, but rather 


provide additional information are called adjuncts. An example is the adverb deeply in 
(70): 


(70) John loves Mary deeply. 


This says something about the intensity of the relation described by the verb. Further 
examples of adjuncts are attributive adjectives (71a) and relative clauses (71b): 


(71) a. a beautiful woman 


b. the man who Mary loves 
Adjuncts have the following syntactic/semantic properties: 


(72) a. Adjuncts do not fulfil a semantic role. 
b. Adjuncts are optional. 
c. Adjuncts can be iterated. 
The phrase in (71a) can be extended by adding another adjunct: 


(73) a beautiful clever woman 


If one puts processing problems aside for a moment, this kind of extension by adding 
adjectives could proceed infinitely (see the discussion of (38) on page 65). Arguments, 
on the other hand, cannot be realized more than once: 


(74) * The man the boy sleeps. 


If the entity carrying out the sleeping action has already been mentioned, then it is 
not possible to have another noun phrase which refers to a sleeping individual. If one 
wants to express the fact that more than one individual is sleeping, this must be done by 
means of coordination as in (75): 
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(75) The man and the boy are sleeping. 


One should note that the criteria for identifying adjuncts proposed in (72) is not suffi- 
cient, since there are also syntactic arguments that do not fill semantic roles (e.g., es ‘it’ 
in (68a) and sich (REFL) in (68b)) or are optional as pizza in (76). 


(76) Tony is eating (pizza). 


Heads normally determine the syntactic properties of their arguments in a relatively 
fixed way. A verb is responsible for the case which its arguments bear. 


(77) a. Ergedenkt des Opfers. 

he remembers the.GEN victim.GEN 
‘He remembers the victim: 

b. *Er gedenkt dem Opfer. 
he remembers the.DAT victim 

c. Erhilft dem Opfer. 
he helps the.pAT victim 
‘He helps the victim: 


d. * Er hilft des Opfers. 
he helps the.GEN victim.GEN 


The verb governs the case of its arguments. 
The preposition and the case of the noun phrase in the prepositional phrase are both 
determined by the verb: 


(78) a. Erdenkt an seine Modelleisenbahn. 

he thinks on his.Acc model.railway 
‘He is thinking of his model railway: 

b. 4 Er denkt an seiner Modelleisenbahn. 
He thinks on his.DAT model.railway 

c. Er hángtan seiner Modelleisenbahn. 
He hangs on his.DAT model.railway 
‘He clings to his model railway: 


d. *Er hängt an seine Modelleisenbahn. 
he hangs on his.acc model.railway 


The case of noun phrases in modifying prepositional phrases, on the other hand, depends 
on their meaning. In German, directional prepositional phrases normally require anoun 
phrase bearing accusative case (79a), whereas local PPs (denoting a fixed location) appear 
in the dative case (79b): 


“For similar examples, see Eisenberg (1994b: 78). 
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(79) a. Er geht in die Schule / auf den Weihnachtsmarkt / unter die 
he goes in the.acc school on the.acc Christmas.market under the.acc 
Brücke. 
bridge 
‘He is going to school/to the Christmas market/under the bridge’ 


b. Er schläft in der Schule / auf dem Weihnachtsmarkt / unter der 
he sleeps in the.pAT school on the.pAT Christmas.market under the.DAT 
Brücke. 
bridge 
‘He is sleeping at school/at the Christmas market/under the bridge’ 


An interesting case is the verb sich befinden ‘to be located’, which expresses the lo- 
cation of something. This cannot occur without some information about the location 
pertaining to the verb: 


(80) "Wir befinden uns. 
we are.located REFL 


The exact form of this information is not fixed - neither the syntactic category nor the 
preposition inside of prepositional phrases is restricted: 


(81) Wir befinden uns hier / unter der Brücke / neben dem Eingang / im Bett. 
we are REFLhere underthe bridge  nexttothe entrance in bed 


"We are here/under the bridge/next to the entrance/in bed: 


Local modifiers such as hier ‘here’ or unter der Brücke ‘under the bridge’ are analyzed 
with regard to other verbs (e.g., schlafen ‘sleep’) as adjuncts. For verbs such as sich 
befinden ‘to be (located), we will most likely have to assume that information about 
location forms an obligatory syntactic argument of the verb. 

The verb selects a phrase with information about location, but does not place any syn- 
tactic restrictions on its type. This specification of location behaves semantically like 
the other adjuncts we have seen previously. If I just consider the semantic aspects of 
the combination of a head and adjunct, then I also refer to the adjunct as a modifier.” 
Arguments specifying location with verbs such as sich befinden ‘to be located’ are also 
subsumed under the term modifier. Modifiers are normally adjuncts, and therefore op- 
tional, whereas in the case of sich befinden they seem to be (obligatory) arguments. 

In conclusion, we can say that constituents that are required to occur with a certain 
head are arguments of that head. Furthermore, constituents which fulfil a semantic role 
with regard to the head are also arguments. These kinds of arguments can, however, 
sometimes be optional. 


See Section 1.7.2 for more on the grammatical function of adverbials. The term adverbial is normally used 
in conjunction with verbs. modifier is a more general term, which normally includes attributive adjectives. 
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Arguments are normally divided into subjects and complements. Not all heads re- 
quire a subject (see Müller 2007a: Section 3.2). The number of arguments of a head can 
therefore also correspond to the number of complements of a head. 


17 Grammatical functions 


In some theories, grammatical functions such as subject and object form part of the for- 
mal description of language (see Chapter 7 on Lexical Functional Grammar, for example). 
This is not the case for the majority of the theories discussed here, but these terms are 
used for the informal description of certain phenomena. For this reason, I will briefly 
discuss them in what follows. 


17.1 Subjects 


Although I assume that the reader has a clear intuition about what a subject is, it is by 
no means a trivial matter to arrive at a definition of the word subject which can be used 
cross-linguistically. For German, Reis (1982) suggested the following syntactic properties 
as definitional for subjects: 


e agreement of the finite verb with it 

* nominative case in non-copular clauses 
e omitted in infinitival clauses (control) 

e optional in imperatives 


I have already discussed agreement in conjunction with the examples in (4). Reis (1982) 
argues that the second bullet point is a suitable criterion for German. She formulates a 
restriction to non-copular clause because there can be more than one nominative argu- 
ment in sentences with predicate nominals such as (82): 


(82) a. Er ist ein Lügner. 
he.nom ista  liar.NOM 
‘He is a liar. 


b. Er wurde ein Lügner genannt. 
he.NoM was a liar.nom called 


‘He was called a liar? 


Following this criterion, arguments in the dative case such as den Männern ‘the men’ 
cannot be classed as subjects in German: 


1671 some schools the term complement is understood to include the subject, that is, the term complement is 
equivalent to the term argument (see for instance Groß 2003: 342). Some researchers treat some subjects, 
e.g., those of finite verbs, as complements (Pollard 1996a; Eisenberg 1994a: 376). 
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(83) a. Erhilft den Männern. 
he helps the.DAT men.DAT 


‘He is helping the men! 


b. Den Männern wurde geholfen. 
the.DAT men.DAT were.3SG helped 


"Ihe men were helped: 


Following the other criteria, datives should also not be classed as subjects - as Reis (1982) 
has shown. In (83b), wurde, which is the 3rd person singular form, does not agree with 
den Männern. The third ofthe aforementioned criteria deals with infinitive constructions 
such as those in (84): 


(84) a. Klaus behauptet, den Männern zu helfen. 
Klaus claims the.DAT men.DAT to help 


‘Klaus claims to be helping the men! 


b. Klaus behauptet, dass er den Männern hilft. 
Klaus claims that he the.DAT men.pAT helps 


‘Klaus claims that he is helping the men! 

c. * Die Mànner behaupten, geholfen zu werden. 
the men claim helped to become 
Intended: “The men are claiming to be helped? 

d. * Die Männer behaupten, elegant getanzt zu werden. 
the men claim elegantly danced to become 


Intended: “The men claim that there is elegant dancing: 


In the first sentence, an argument of the verb helfen 'to help' has been omitted. If one 
wishes to express it, then one would have to use the subordinate clause beginning with 
dass ‘that’ as in (84b). Examples (84c,d) show that infinitives which do not require a 
nominative argument cannot be embedded under verbs such as behaupten ‘to claim’. 
If the dative noun phrase den Männern ‘the men’ were the subject in (83b), we would 
expect the control construction (84c) to be well-formed. This is, however, not the case. 
Instead of (84c), it is necessary to use (85): 


(85) Die Männer behaupten, dass ihnen geholfen wird. 
the men.nom claim that them.pAT helped becomes 
"Ihe men claim that they are being helped: 


In the same way, imperatives are not possible with verbs that do not require a nomina- 
tive. (86) shows some examples from Reis (1982: 186). 


(86) a. Fürchte dich nicht! 
be.scared REFL not 


‘Don’t be scared!’ 
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b. * Graue nicht! 
dread not 


‘Don’t dread it!’ 


c. Werd einmal unterstützt und ... 
be once supported and 


‘Let someone support you for once and ..: 


d. * Werd einmal geholfen und ... 
be once helped and 


‘Let someone help you and ..? 
The verb sich fürchten ‘to be scared’ in (86a) obligatorily requires a nominative argument 


as its subject (87a). The similar verb grauen ‘to dread’ in (86b) takes a dative argument 
(87b). 


(87) a. Ich fürchte mich vor Spinnen. 
LNoM be.scared REFL before spiders 
‘Tam scared of spiders: 


b. Mir graut vor Spinnen. 
me.DAT scares before spiders 


‘I am dreading spiders 
Interestingly, dative arguments in Icelandic behave differently. Zaenen et al. (1985) dis- 
cuss various characteristics of subjects in Icelandic and show that it makes sense to 
describe dative arguments as subjects in passive sentences even if the finite verb does 


not agree with them (Section 3.1) or they do not bear nominative case. An example of 
this is infinitive constructions with an omitted dative argument (p. 457): 


(88) a. Eg vonast til ad verða hjálpað. 
I hope forto be helped 
‘T hope that I will be helped: 


b. Að verahjálpaóí prófinu er óleyfilegt. 
to be helped on the.exam is not.allowed 


‘It is not allowed for one to be helped during the exam: 
In a number of grammars, clausal arguments such as those in (89) are classed as subjects 


as they can be replaced by a noun phrase in the nominative (90) (see e.g., Eisenberg 2004: 
63, 289). 


(89) a. Dasser schon um sieben kommen wollte, stimmt nicht. 
that he already at seven come wanted is.true not 
‘It’s not true that he wanted to come as soon as seven. 


b. Dasser Maria geheiratet hat, gefállt mir. 
that he Maria married has pleases me 


Tm glad that he married Maria! 
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(90) a. Das stimmt nicht. 
that is.true not 


“That isn’t true: 


b. Das gefallt mir. 
that pleases me 


‘I like that? 


It should be noted that there are different opinions on the question of whether clausal 
arguments should be treated as subjects or not. As recent publications show, there is 
still some discussion in Lexical Function Grammar (see Chapter 7) (Dalrymple & Lødrup 
2000; Berman 2003b; 2007; Alsina, Mohanan & Mohanan 2005; Forst 2006). 

If we can be clear about what we want to view as a subject, then the definition of object 
is no longer difficult: objects are all other arguments whose form is directly determined 
by a given head. As well as clausal objects, German has genitive, dative, accusative and 
prepositional objects: 


(91) a. Sie gedenken des Mannes. 
they remember the.GEN man.GEN 


‘They remember the man! 


b. Sie helfendem Mann. 
they help the.DAT man.DAT 


‘They are helping the man! 

c. Sie kennenden Mann. 
they know  the.Acc man.acc 
‘They know the man! 

d. Sie denken an den Mann. 
they think onthe man 


“They are thinking of the man! 


As well as defining objects by their case, it is commonplace to talk of direct objects and 
indirect objects. The direct object gets its name from the fact that - unlike the indirect 
object - the referent of a direct object is directly affected by the action denoted by the 
verb. With ditransitives such as the German geben ‘to give’, the accusative object is the 
direct object and the dative is the indirect object. 


(92) dass er dem Mann den Aufsatz gibt 
that he.NoM the.DAT man.paT the.Acc essay.Acc gives 
“that he gives the man the essay’ 
For trivalent verbs (verbs taking three arguments), we see that the verb can take either 


an object in the genitive case (93a) or, for verbs with a direct object in the accusative, a 
second accusative object (93b): 
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(93) a. dasser den Mann des Mordes bezichtigte 
that he the.acc man.acc the.GEN murder.GEN accused 


‘that he accused the man of murder’ 


b. dass er den Mann den Vers lehrte 
that he the.acc man.acc the.acc verse.Acc taught 


‘that he taught the man the verse’ 


These kinds of objects are sometimes also referred to as indirect objects. 

Normally, only those objects which are promoted to subject in passives with werden ‘to 
be’ are classed as direct objects. This is important for theories such as LFG (see Chapter 7) 
since passivization is defined with reference to grammatical function. With two-place 
verbal predicates, the dative is not normally classed as a direct object (Cook 2006). 


(94) dasser dem Mann hilft 
that he Oe par man.pDAT helps 


‘that he helps the man’ 


In many theories, grammatical function does not form a primitive component of the the- 
ory, but rather corresponds to positions in a tree structure. The direct object in German 
is therefore the object which is first combined with the verb in a configuration assumed 
to be the underlying structure of German sentences. The indirect object is the second 
object to be combined with the verb. On this view, the dative object of helfen ‘to help’ 
would have to be viewed as a direct object. 

In the following, I will simply refer to the case of objects and avoid using the terms 
direct object and indirect object. 

In the same way as with subjects, we consider whether there are object clauses which 
are equivalent to a certain case and can fill the respective grammatical function of a 
direct or indirect object. If we assume that dass du sprichst ‘that you are speaking’ in 
(95a) is a subject, then the subordinate clause must be a direct object in (95b): 


(95) a. Dass du sprichst, wird erwähnt. 
that you speak is mentioned 
"Ihe fact that you're speaking is being mentioned’ 


b. Er erwähnt, dass du sprichst. 
he mentions that you speak 


‘He mentions that you are speaking’ 


In this case, we cannot really view the subordinate clause as the accusative object since 
it does not bear case. However, we can replace the sentence with an accusative-marked 
noun phrase: 


(96) Er erwähnt diesen Sachverhalt. 
he mentions this.Acc matter 


‘He mentions this matter. 


If we want to avoid this discussion, we can simply call these arguments clausal objects. 
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1.7.2 The adverbial 


Adverbials differ semantically from subjects and objects. They tell us something about 
the conditions under which an action or process takes place, or the way in which a 
certain state persists. In the majority of cases, adverbials are adjuncts, but there are - as 
we have already seen - a number of heads which also require adverbials. Examples of 
these are verbs such as to be located or to make one’s way. For to be located, it is necessary 
to specify a location and for to proceed to a direction is needed. These kinds of adverbials 
are therefore regarded as arguments of the verb. 

The term adverbial comes from the fact that adverbials are often adverbs. This is 
not the only possibility, however. Adjectives, participles, prepositional phrases, noun 
phrases and even sentences can be adverbials: 


(97) a. Er arbeitet sorgfältig. 
he works carefully 


b. Er arbeitet vergleichend. 
he works comparatively 


‘He does comparative work! 


c. Er arbeitet in der Universitat. 
he works in the university 


‘He works at the university. 


d. Er arbeitet den ganzen Tag. 
he works the whole day.acc 


‘He works all day? 


e. Er arbeitet, weil es ihm Spaß macht. 
he works because it him.DAT fun makes 


‘He works because he enjoys it! 


Although the noun phrase in (97d) bears accusative case, it is not an accusative object. 
den ganzen Tag ‘the whole day’ is a so-called temporal accusative. The occurrence of 
accusative in this case has to do with the syntactic and semantic function of the noun 
phrase, it is not determined by the verb. These kinds of accusatives can occur with a 
variety of verbs, even with verbs that do not normally require an accusative object: 


(98) a. Er schlaft den ganzen Tag. 
he sleeps the whole day 


‘He sleeps the whole day. 


b. Er best den ganzen Tag diesen schwierigen Aufsatz. 
he reads the.acc whole.Acc day this.Acc difficult.Acc essay 


‘He spends the whole day reading this difficult essay’ 


c. Ergibt den Armen den ganzen Tag Suppe. 
he gives the.DAT poor.DArT the.acc whole.Acc day soup 


‘He spends the whole day giving soup to the poor? 
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The case of adverbials does not change under passivization: 


(99) 


a. 


b. 


weil den ganzen Tag gearbeitet wurde 
because the.acc whole.acc day worked was 


"because someone worked all day’ 


“weil der ganze Tag gearbeitet wurde 
because the.NoM whole.NoM day worked was 


1.7.3 Predicatives 


Adjectives like those in (100a,b) as well as noun phrases such as ein Lügner ‘a liar’ in 
(100c) are counted as predicatives. 


(100) 


a. 


Klaus ist klug. 
Klausis clever 

Er isst den Fisch roh. 
he eats the fish raw 


Er ist ein Lügner. 
heis a liar 


In the copula construction in (100a,c), the adjective klug ‘clever’ and the noun phrase 
ein Lügner 'a liar' is an argument of the copula sein 'to be' and the depictive adjective in 
(100b) is an adjunct to isst ‘eats’. 

For predicative noun phrases, case is not determined by the head but rather by some 
other element." For example, the accusative in (101a) becomes nominative under pas- 
sivization (101b): 


There is some dialectal variation with regard to copula constructions: in Standard German, the case of the 
noun phrase with sein ‘to be’ is always nominative and does not change when embedded under lassen ‘to 
let’. According to Drosdowski (1995: $ 1259), in Switzerland the accusative form is common which one finds 
in examples such as (ii.a). 


(i) 


(ii) 


a. 


Ich bin dein Tanzpartner. 
I am yourNoM dancing.partner 


. Der wüste Kerl ist ihr Komplize. 


the wild guy is her.NoM accomplice 


. Laß den wüsten Kerl[...] meinetwegen ihr Komplize sein. 


let the.Acc wild.Acc guy forallLcare her.NoM accomplice be 


"Let's assume that the wild guy is her accomplice, for all I care’ (Grebe & Gipper 1966: § 6925) 


. Baby, laß mich dein Tanzpartner sein. 


baby let me.acc your.Nom dancing.partner be 
‘Baby, let me be your dancing partner!’ (Funny van Dannen, Benno-Ohnesorg-Theater, Berlin, 
Volksbühne, 11.10.1995) 


Er lasst den lieben Gott ‘n frommen Mann sein. 
he lets the.acc dear.acc god a pious.acc man be 


‘He is completely lighthearted/unconcerned. 


. *Er lässt den lieben Gott ‘nfrommer Mann sein. 


he lets the.Acc dear.acc god a pious.NoM man be 
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(101) a. Sie nannte ihn einen Lügner. 
she called him.Acc a.Acc liar 


‘She called him a liar. 


b. Er wurde ein Lügner genannt. 
he.nom was  a.NoMliar called 


‘He was called a liar. 


Only ihn ‘him’ can be described as an object in (101a). In (101b), ihn becomes the subject 
and therefore bears nominative case. einen Lügner ‘a liar’ refers to ihn ‘him’ in (101a) 
and to er ‘he’ in (101b) and agrees in case with the noun over which it predicates. This is 
also referred to as agreement case. 

For other predicative constructions see Eisenberg et al. (2005: § 1206) and Müller 
(2002a: Chapter 4, Chapter 5) and Miiller (2008). 


1.7.4 Valence classes 


It is possible to divide verbs into subclasses depending on how many arguments they 
require and on the properties these arguments are required to have. The classic division 
describes all verbs which have an object which becomes the subject under passivization 
as transitive. Examples of this are verbs such as love or beat. Intransitive verbs, on the 
other hand, are verbs which have either no object, or one that does not become the 
subject in passive sentences. Examples of this type of verb are schlafen ‘to sleep’, helfen 
‘to help’, gedenken ‘to remember’. A subclass of transitive verbs are ditransitive verbs 
such as geben ‘to give’ and zeigen ‘to show’. 

Unfortunately, this terminology is not always used consistently. Sometimes, two-place 
verbs with dative and genitive objects are also classed as transitive verbs. In this naming 
tradition, the terms intransitive, transitive and ditransitive are synonymous with one- 
place, two-place and three-place verbs. 

The fact that this terminological confusion can lead to misunderstandings between 
even established linguistics is shown by Culicover and Jackendoff’s (2005: 59) criticism 
of Chomsky. Chomsky states that the combination of the English auxiliary be + verb 
with passive morphology can only be used for transitive verbs. Culicover and Jackendoff 
claim that this cannot be true because there are transitive verbs such as weigh and cost, 
which cannot undergo passivization: 


(102) a. This book weighs ten pounds / costs ten dollars. 
b. * Ten pounds are weighed / ten dollar are cost by this book. 
Culicover and Jackendoff use transitive in the sense of a verb requiring two arguments. 
If we only view those verbs whose object becomes the subject of a passive clause as 


transitive, then weigh and cost no longer count as transitive verbs and Culicover and 
Jackendoff's criticism no longer holds.? That noun phrases such as those in (102) are no 


18 Their cricitism also turns out to be unjust even if one views transitives as being two-place predicates. 
If one claims that a verb must take at least two arguments to be able to undergo passivization, one is not 
necessarily claiming that all verbs taking two or more arguments have to allow passivization. The property 
of taking multiple arguments is a condition which must be fulfilled, but it is by no means the only one. 
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ordinary objects can also be seen by the fact they cannot be replaced by pronouns. It is 
therefore not possible to ascertain which case they bear since case distinctions are only 
realized on pronouns in English. If we translate the English examples into German, we 
find accusative objects: 


(103) a. Das Buch kostete einen Dollar. 
the book costs one.acc dollar 
“The book costs one dollar’ 


b. Das Buch wiegt einen Zentner. 
the book weighs one.Acc centner 
"Ihe book weighs one centner? 

In the following, I will use transitive in the former sense, that is for verbs with an 
object that becomes the subject when passivized (e.g., with werden in German). When 
I talk about the class of verbs that includes helfen 'to help', which takes a nominative 
and dative argument, and schlagen ‘to hit’, which takes a nominative and accusative 
argument, I will use the term two-place or bivalent verb. 


1.8 A topological model of the German clause 


In this section, I introduce the concept of so-called topological fields (topologische Felder). 
These will be used frequently in later chapters to discuss different parts of the German 
clause. One can find further, more detailed introductions to topology in Reis (1980), 
Höhle (1986) and Askedal (1986). Wöllstein (2010) is a textbook about the topological 
field model. 


1.8.1 The position of the verb 
It is common practice to divide German sentences into three types pertaining to the 
position of the finite verb: 
e verb-final clauses 
e verb-first (initial) clauses 
e verb-second (V2) clauses 
The following examples illustrate these possibilities: 


(104) a. (Peter hat erzählt,) dass er das Eis gegessen hat. 
Peter has told that he the ice.cream eaten has 
‘Peter said that he has eaten the ice cream. 


b. Hat Peter das Eis gegessen? 
has Peter the ice.cream eaten 


‘Has Peter eaten the ice cream?’ 


c. Peter hat das Eis gegessen. 
Peter has the ice.cream eaten 


‘Peter has eaten the ice cream: 
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1.8.2 The sentence bracket, prefield, middle field and postfield 


We observe that the finite verb hat ‘has’ is only adjacent to its complement gegessen 
‘eaten’ in (104a). In (104b) and (104c), the verb and its complement are separated, that is, 
discontinuous. We can then divide the German clause into various sub-parts on the ba- 
sis of these distinctions. In (104b) and (104c), the verb and the auxiliary form a “bracket” 
around the clause. For this reason, we call this the sentence bracket (Satzklammer). The 
finite verbs in (104b) and (104c) form the left bracket and the non-finite verbs form the 
right bracket. Clauses with verb-final order are usually introduced by conjunctions such 
as weil ‘because’, dass ‘that’ and ob ‘whether’. These conjunctions occupy the same po- 
sition as the finite verb in verb-initial or verb-second clauses. We therefore also assume 
that these conjunctions form the left bracket in these cases. Using the notion of the sen- 
tence bracket, it is possible to divide the structure of the German clause into the prefield 
(Vorfeld), middle field (Mittelfeld) and postfield (Nachfeld). The prefield describes every- 
thing preceding the left sentence bracket, the middle field is the section between the left 
and right bracket and the postfield describes the position after the right bracket. The Ta- 
bles 1.1 and 1.2 give some examples of this. The right bracket can contain multiple verbs 
and is often referred to as a verbal complex or verb cluster. The assignment of question 
words and relative pronouns to the prefield will be discussed in the following section. 


Table 1.1: Examples of how topological fields can be occupied in declarative main clauses 


Prefield Left b. Middle field Rightb. Postfield 


Karl schläft. 
Kar ` sleeps 


Karl hat geschlafen. 
Karl has slept 


Karl erkennt Maria. 
Karl recognizes Maria 


Karl färbt den Mantel um den Maria kennt. 

Karl dies the coat PARTICLE that Maria knows 

Karl hat Maria erkannt. 

Karl has Maria recognized 

Karl hat Maria als sie aus dem Bus stieg sofort erkannt. 

Karl has Maria when she got.off the bus immediately recognized 

Karl hat Maria sofort erkannt als sie aus dem Bus stieg. 
Karl has Maria immediately recognized when she got.off the bus 
Karl hat Maria zu erkennen behauptet. 

Karl has Maria to recognize claimed 

Karl hat behauptet Maria zu erkennen. 

Karl has claimed Maria to know 
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Table 1.2: Examples of how topological fields can be occupied in yes/no questions, im- 
peratives, exclamatives and various verb final sentences including adverbial 
clauses, interrogative and relative clauses 


Prefield Left b. Middle field Right b. Postfield 


Schlaft Karl? 
sleeps Karl 


Schlaf! 
sleep 
Iss jetzt deinen Kuchen auf! 
eat now your cake up 
Hat er doch den ganzen Kuchen alleine gegessen! 
has he after all the whole cake alone eaten 
weil er den ganzen Kuchen alleine gegessen hat ohne es zu bereuen 
because he the whole cake alone eaten has without it to regret 
weil er den ganzen Kuchen alleine essen können will ohne gestört zu werden 
because he the whole cake eat can wants.to without distrurbed to be 
wer den ganzen Kuchen alleine gegessen hat 
who the whole cake alone eaten has 
der den ganzen Kuchen alleine gegessen hat 
who.REL the whole cake alone eaten has 
mit wem du geredet hast 
with whom you spoken have 
mit dem du geredet hast 
with whom.REL you spoken have 


1.8.3 Assigning elements to fields 


As the examples in the Tables 1.1 and 1.2 show, it is not required that all fields are always 
occupied. Even the left bracket can be empty if one opts to leave out the copula sein ‘to 
be’ such as in the examples in (105): 


(105) a. [...] egal, was noch passiert, der Norddeutsche Rundfunk 
regardless what still happens the north.German broadcasting.company 
steht schon jetzt als Gewinner fest.” 
stands already now as winner PART 


"Regardless of what still may happen, the North German broadcasting com- 
pany is already the winner. 


PSpiegel, 12/1999, p. 258. 
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b. Interessant, zu erwähnen, daß ihre Seele völlig in Ordnung war. 


20 
interesting to mention thather soul completely inorder was 


‘Tt is interesting to note that her soul was entirely fine! 


. Ein Treppenwitzder Musikgeschichte, daß die Kollegen von Rammstein 


an afterwit of.the history.of.music that the colleagues of Rammstein 


vor fünfJahrennoch im Vorprogramm von Sandow spielten.” 


before five years still in.the pre.programme of Sandow played 


“One of the little ironies of music history is that five years ago their colleagues 
of Rammstein were still an opening act for Sandow. 


The examples in (105) correspond to those with the copula in (106): 


(106) 


a. Egal ist, was noch passiert, ... 


regardless is what still happens 


“It is not important what still may happen ..? 


. Interessant ist zu erwáhnen, dass ihre Seele vóllig in Ordnung war. 


interesting is to mention that her soul completely in order was 


‘Tt is interesting to note that her soul was completely fine’ 


. Ein Treppenwitz der Musikgeschichte ist, dass die Kollegen von 


an afterwit ofthe music.history is that the colleagues of 
Rammstein vor fünf Jahren noch im Vorprogramm von Sandow spielten. 
Rammstein before five years still in pre.programme of Sandow played 


‘It is one of the little ironies of music history that five years ago their col- 
leagues of Rammstein were still an opening act for Sandow’ 


When fields are empty, it is sometimes not clear which fields are occupied by certain 
constituents. For the examples in (105), one would have to insert the copula to be able 
to ascertain that a single constituent is in the prefield and, furthermore, which fields are 
occupied by the other constituents. 


In the following example taken from Paul (1919: 13), inserting the copula obtains a 


different result: 


(107) 


a. Niemand da? 


nobody there 


. Ist niemand da? 


is nobody there 
‘Is nobody there?’ 


Here we are dealing with a question and niemand ‘nobody’ in (107a) should therefore 
not be analyzed as in the prefield but rather the middle field. 


20 Michail Bulgakow, Der Meister und Margarita. München: Deutscher Taschenbuch Verlag. 1997, p. 422. 
“I Flüstern & Schweigen, taz, 12.07.1999, p. 14. 
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In (108), there are elements in the prefield, the left bracket and the middle field. The 
right bracket is empty.” 


(108) Er gibt der Frau das Buch, die er kennt. 
he.m gives the woman(F) the book.(n) that.r he knows 


‘He gives the book to the woman that he knows: 


How should we analyze relative clauses such as die er kennt ‘that he knows’? Do they 
form part of the middle field or the postfield? This can be tested using a test developed 
by Bech (1955: 72) (Rangprobe): first, we modify the example in (108) so that it is in 
the perfect. Since non-finite verb forms occupy the right bracket, we can clearly see 
the border between the middle field and postfield. The examples in (109) show that the 
relative clause cannot occur in the middle field unless it is part of a complex constituent 
with the head noun Frau ‘woman’. 


(109) a. Er hat [der Frau] das Buch gegeben, [die er kennt]. 
he has the woman the book given that he knows 
‘He has given the book to the woman that he knows: 
b. *Er hat [der Frau] das Buch, [die er kennt,] gegeben. 
he has the woman the book that he knows given 


c. Er hat [der Frau, die er kennt,] das Buch gegeben. 
he has the woman that he knows the book given 


This test does not help if the relative clause is realized together with its head noun at the 
end of the sentence as in (110): 


(110) Er gibt das Buch der Frau, die er kennt. 
he gives the book the woman that he knows 


‘He gives the book to the woman that he knows: 


If we put the example in (110) in the perfect, then we observe that the lexical verb can 
occur before or after the relative clause: 


(111) a. Er hat das Buch [der Frau] gegeben, [die er kennt]. 
he has the book the woman given that he knows 
‘He has given the book to the woman he knows: 


b. Er hat das Buch [der Frau, die er kennt,] gegeben. 
he has the book the woman that he knows given 


In (111a), the relative clause has been extraposed. In (111b) it forms part of the noun phrase 
der Frau, die er kennt ‘the woman that he knows’ and therefore occurs inside the NP in 
the middle field. It is therefore not possible to rely on this test for (110). We assume that 
the relative clause in (110) also belongs to the NP since this is the most simple structure. If 


22 : . ; ; : 
The sentence requires emphasis on der ‘the’. der Frau, die er kennt ‘the woman’ is contrasted with another 
woman or other women. 
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the relative clause were in the postfield, we would have to assume that it has undergone 
extraposition from its position inside the NP. That is, we would have to assume the NP- 
structure anyway and then extraposition in addition. 

We have a similar problem with interrogative and relative pronouns. Depending on 
the author, these are assumed to be in the left bracket (Kathol 2001; Dürscheid 2003: 
94-95; Eisenberg 2004: 403; Pafel 2011: 57) or the prefield (Eisenberg et al. 2005: §1345; 
Wöllstein 2010: 29-30, Section 3.1) or even in the middle field (Altmann & Hofman 2004: 
75). In Standard German interrogative or relative clauses, both fields are never simulta- 
neously occupied. For this reason, it is not immediately clear to which field an element 
belongs. Nevertheless, we can draw parallels to main clauses: the pronouns in interrog- 
ative and relative clauses can be contained inside complex phrases: 


(112) a. der Mann, [mit dem] du gesprochen hast 
the man with whom you spoken have 


‘the man you spoke to’ 


b. Ich möchte wissen, [mit wem] du gesprochen hast. 
I wanttoknow with whom you spoken have 


‘I want to know who you spoke to’ 


Normally, only individual words (conjunctions or verbs) can occupy the left bracket,” 
whereas words and phrases can appear in the prefield. It therefore makes sense to assume 
that interrogative and relative pronouns (and phrases containing them) also occur in this 
position. 

Furthermore, it can be observed that the dependency between the elements in the 
Vorfeld of declarative clauses and the remaining sentence is of the same kind as the 
dependency between the phrase that contains the relative pronoun and the remaining 
sentence. For instance, über dieses Thema ‘about this topic’ in (113a) depends on Vortrag 
‘talk’, which is deeply embedded in the sentence: einen Vortrag ‘a talk’ is an argument 
of zu halten ‘to hold’, which in turn is an argument of gebeten ‘asked’. 


(113) a. Uber dieses Thema habe ich ihn gebeten, einen Vortrag zu halten. 
about this topic havel himasked a talk to hold 
‘I asked him to give a talk about this topic. 


b. das Thema, über das ichihn gebeten habe, einen Vortrag zu halten 
the topic about whichI himasked have a talk to hold 


‘the topic about which I asked him to give a talk’ 
The situation is similar in (113b): the relative phrase über das ‘about which’ is a dependent 


of Vortrag ‘talk’ which is realized far away from it. Thus, if the relative phrase is assigned 
to the Vorfeld, it is possible to say that such nonlocal frontings always target the Vorfeld. 


P Coordination is an exception to this: 


(i) Er [kennt und liebt] diese Schallplatte. 
he knows and loves this record 


‘He knows and loves this record. 
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Finally, the Duden grammar (Eisenberg et al. 2005: §1347) provides the following ex- 
amples from non-standard German (mainly southern dialects): 


(114) a. Kommt drauf an, mit wem dass sie zu tun haben. 
comes there.upon PART with whom that you to do have 


‘It depends on whom you are dealing with’ 


(115) a. Lotti, die wo eine tolle Sekretärin ist, hat ein paar merkwürdige 
Lotti who where a great secretary is hasa few strange 
Herren empfangen. 
gentlemen welcomed 


‘Lotti, who is a great secretary, welcomed a few strange gentlemen’ 


b. Du bist der beste Sänger, den wo ich kenn. 
you are the best singer who whereI know 


"You are the best singer whom I know. 


These examples of interrogative and relative clauses show that the left sentence bracket 
is filled with a conjunction (dass ‘that’ or wo ‘where’ in the respective dialects). So if one 
wants to have a model that treats Standard German and the dialectal forms uniformly, 
it is reasonable to assume that the relative phrases and interrogative phrases are located 


in the Vorfeld. 


1.8.4 Recursion 


As already noted by Reis (1980: 82), when occupied by a complex constituent, the prefield 
can be subdivided into further fields including a postfield, for example. The constituents 
für lange lange Zeit ‘for a long, long time’ in (116b) and daß du kommst ‘that you are com- 
ing’ in (116d) are inside the prefield but occur to the right of the right bracket verschiittet 
‘buried’ / gewußt ‘knew’, that is they are in the postfield of the prefield. 


(116) a. Die Möglichkeit, etwas zu verändern, ist damit verschüttet für lange 
the possibility somethingto change is there.with buried for long 
lange Zeit. 
long time 


"Ihe possibility to change something will now be gone for a long, long time! 
b. [Verschüttet für lange lange Zeit] ist damit die Möglichkeit, etwas 
buried for long long time ist there.with the possibility something 
zu verändern. 
to change 
c. Wir haben schon seit langem gewußt, daß du kommst. 
we have PART since Jong known that you come 
"We have known for a while that you are coming: 
d. [Gewußt, daß du kommst,] haben wir schon seit langem. 
known that you come have we PART since long 
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Like constituents in the prefield, elements in the middle field and postfield can also have 
an internal structure and be divided into subfields accordingly. For example, daß ‘that’ 
is the left bracket of the subordinate clause daß du kommst in (116c), whereas du ‘you’ 
occupies the middle field and kommst ‘come’ the right bracket. 


Comprehension questions 


1. How does the head of a phrase differ from non-heads? 


2. What is the head in the examples in (117)? 


(117) a. he 
b. Go! 
c. quick 
3. How do arguments differ from adjuncts? 


4. Identify the heads, arguments and adjuncts in the following sentence (118) and 
in the subparts of the sentence: 


(118) Er hilft den kleinen Kindern in der Schule. 
he helps the small children in the school 


‘He helps small children at school. 


5. How can we define the terms prefield (Vorfeld), middle field (Mittelfeld), post- 
field (Nachfeld) and the left and right sentence brackets (Satzklammer)? 
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Exercises 


1.8 A topological model of the German clause 


1. Identify the sentence brackets, prefield, middle field and postfield in the fol- 
lowing sentences. Do the same for the embedded clauses! 


(119) 


a. 


Karl isst. 
Karl eats 


‘Karl is eating’ 
Der Mann liebt eine Frau, den Peter kennt. 
the man lovesa woman who Peter knows 


“The man who Peter knows loves a woman. 


Der Mann liebt eine Frau, die Peter kennt. 
the man lovesa woman that Peter knows 


“The man loves a woman who Peter knows. 


Die Studenten haben behauptet, nur wegen derHitze 
the students have claimed only because.of the heat 
einzuschlafen. 

to.fall.asleep 


“The students claimed that they were only falling asleep because of 
the heat. 


Dass Peter nicht kommt, ärgert Klaus. 
that Peternot comes annoys Klaus 


‘(The fact) that Peter isn't coming annoys Klaus: 


Einen Mann küssen, der ihr nicht gefällt, würde sie nie. 
a man kiss that her not pleases would she never 


‘She would never kiss a man she doesn't like? 
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Further reading 


Reis (1980) gives reasons for why field theory is important for the description of 
the position of constituents in German. 

Höhle (1986) discusses fields to the left of the prefield, which are needed for 
left-dislocation structures such as with der Mittwoch in (120), aber in (121a) and 
denn in (121b): 


(120) Der Mittwoch, der passt mir gut. 
the Wednesday that fits me good 
“Wednesday, that suits me fine! 


(121) a. Aber würde denn jemand den Hund füttern morgen Abend? 
but would part anybody the dog feed tomorrow evening 


‘But would anyone feed the dog tomorrow evening?’ 


b. Denn dass es regnet, damit rechnet keiner. 
because that it rains there.with reckons nobody 


‘Because no-one expects that it will rain’ 


Höhle also discusses the historical development of field theory. 


2 Phrase structure grammar 


This chapter deals with phase structure grammars (PSGs), which play an important role 
in several of the theories we will encounter in later chapters. 


2.1 Symbols and rewrite rules 


Words can be assigned to a particular part of speech on the basis of their inflectional 
properties and syntactic distribution. Thus, weil ‘because’ in (1) is a conjunction, whereas 
das ‘the’ and dem ‘the’ are articles and therefore classed as determiners. Furthermore, 
Buch ‘book’ and Mann ‘man’ are nouns and gibt ‘gives’ is a verb. 


(1) weil er das Buch dem Mann gibt 
because he the book the man gives 


‘because he gives the man the book’ 


Using the constituency tests we introduced in Section 1.3, we can show that individual 
words as well as the strings das Buch ‘the book’ and dem Mann ‘the man’, form con- 
stituents. These get then assigned certain symbols. Since nouns form an important part 
of the phrases das Buch and dem Mann, these are referred to as noun phrases or NPs, for 
short. The pronoun er ‘he’ can occur in the same positions as full NPs and can therefore 
also be assigned to the category NP. 

Phrase structure grammars come with rules specifying which symbols are assigned 
to certain kinds of words and how these are combined to create more complex units. A 
simple phrase structure grammar which can be used to analyze (1) is given in (2):"? 


(2) NP—DetN NP >er N — Buch 
S —NPNPNPV Det — das N — Mann 
Det — dem V > gibt 


We can therefore interpret a rule such as NP — Det N as meaning that a noun phrase, 
that is, something which is assigned the symbol NP, can consist of a determiner (Det) 
and a noun (N). 


! ignore the conjunction weil ‘because’ for now. Since the exact analysis of German verb-first and verb- 
second clauses requires a number of additional assumptions, we will restrict ourselves to verb-final clauses 
in this chapter. 

[he rule NP er may seem odd. We could assume the rule PersPron — er instead but then would have 
to posit a further rule which would specify that personal pronouns can replace full NPs: NP — PersPron. 
The rule in (2) combines the two aforementioned rules and states that er ‘he’ can occur in positions where 
noun phrases can. 


2 Phrase structure grammar 


We can analyze the sentence in (1) using the grammar in (2) in the following way: 
first, we take the first word in the sentence and check if there is a rule in which this 
word occurs on the right-hand side of the rule. If this is the case, then we replace the 
word with the symbol on the left-hand side of the rule. This happens in lines 2-4, 6-7 
and 9 of the derivation in (3). For instance, in line 2 er is replaced by NP. If there are two 
or more symbols which occur together on the right-hand side of a rule, then all these 
words are replaced with the symbol on the left. This happens in lines 5, 8 and 10. For 
instance, in line 5 and 8, Det and N are rewritten as NP. 


(3) words and symbols rules that are applied 

1] er das Buch dem Mann gibt 

2| NP das Buch dem Mann gibt NP > er 

3|NP Det Buch dem Mann gibt Det — das 
4|NP Det N dem Mann gibt N — Buch 

5 | NP NP dem Mann gibt NP — Det N 

6 | NP NP Det Mann gibt Det — dem 

7 | NP NP Det N gibt N — Mann 

8 | NP NP NP gibt NP — Det N 

9 | NP NP NP V V — gibt 
10 S S — NP NP NP V 


In (3), we began with a string of words and it was shown that we can derive the structure 
of a sentence by applying the rules of a given phrase structure grammar. We could have 
applied the same steps in reverse order: starting with the sentence symbol S, we would 
have applied the steps 9-1 and arrived at the string of words. Selecting different rules 
from the grammar for rewriting symbols, we could use the grammar in (2) to get from 
S to the string er dem Mann das Buch gibt ‘he the man the book gives’. We can say that 
this grammar licenses (or generates) a set of sentences. 

The derivation in (3) can also be represented as a tree. This is shown by Figure 2.1. The 


NP NP NP V 


P S 


De N Det N 


er das Buch dem Mann gibt 
he the book the man gives 


Figure 2.1: Analysis of er das Buch dem Mann gibt ‘he the book the man gives’ 
symbols in the tree are called nodes. We say that S immediately dominates the NP nodes 


and the V node. The other nodes in the tree are also dominated, but not immediately 
dominated, by S. If we want to talk about the relationship between nodes, it is common 
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to use kinship terms. In Figure 2.1, S is the mother node of the three NP nodes and the 
V node. The NP node and V are sisters since they have the same mother node. If a node 
has two daughters, then we have a binary branching structure. If there is exactly one 
daughter, then we have a unary branching structure. Two constituents are said to be 
adjacent if they are directly next to each other. 

Phrase structure rules are often omitted in linguistic publications. Instead, authors opt 
for tree diagrams or the compact equivalent bracket notation such as (4). 


(4) [s [np er] [np [pet das] [n Buch] [np [pet dem] [n Mann]] [y gibt] 
he the book the man gives 


Nevertheless, it is the grammatical rules which are actually important since these rep- 
resent grammatical knowledge which is independent of specific structures. In this way, 
we can use the grammar in (2) to parse or generate the sentence in (5), which differs 
from (1) in the order of objects: 


(5 [weil] er dem Mann das Buch gibt 
because he.nom he par man  the.Acc book gives 


‘because he gives the man the book’ 


The rules for replacing determiners and nouns are simply applied in a different order 
than in (1). Rather than replacing the first Det with das 'the' and the first noun with 
Buch ‘book’, the first Det is replaced with dem ‘the’ and the first noun with Mann. 

At this juncture, I should point out that the grammar in (2) is not the only possible 
grammar for the example sentence in (1). There is an infinite number of possible gram- 
mars which could be used to analyze these kinds of sentences (see exercise 1). Another 
possible grammar is given in (6): 


(69 NP Det N NP >er N — Buch 
V —NPV Det — das N — Mann 
Det — dem V > gibt 


This grammar licenses binary branching structures as shown in Figure 2.2 on the follow- 
ing page. 

Both the grammar in (6) and (2) are too imprecise. If we adopt additional lexical entries 
for ich 'T and den ‘the’ (accusative) in our grammar, then we would incorrectly license 
the ungrammatical sentences in (7b-d):° 


3With the grammar in (6), we also have the additional problem that we cannot determine when an utterance 
is complete since the symbol V is used for all combinations of V and NP. Therefore, we can also analyze 
the sentence in (i) with this grammar: 


(i) a. "der Mann erwartet 
the man expects 
b. *des Mannes er das Buch dem Mann gibt 


the.GEN man.GEN he.NoM the.acc book the.DAT man gives 


The number of arguments required by a verb must be somehow represented in the grammar. In the fol- 
lowing chapters, we will see exactly how the selection of arguments by a verb (valence) can be captured 
in various grammatical theories. 
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V 
m uio 
NP V 
p m en 
NP V 
ZN ee 
Det N NP V 
ANK 
Det N 


er das Buch dem Mann gibt 
he the book the man gives 


Figure 2.2: Analysis of er das Buch dem Mann gibt with a binary branching structure 


(7) a. er das Buch dem Mann gibt 
he.NoM the.acc book the.DAT man gives 


‘He gives the book to the man! 


b. *ich das Buch dem Mann gibt 
LNoM the.acc book the.DAT man gives 


* 


c. "er das Buchden Mann gibt 
he.NoM the.Acc book the.acc man gives 


d. *er den Buch dem Mann gibt 
he.NoM the.m book(N) the man gives 


In (7b), subject-verb agreement has been violated, in other words: ich 'T and gibt ‘gives’ 
do not fit together. (7c) is ungrammatical because the case requirements of the verb have 
not been satisfied: gibt ‘gives’ requires a dative object. Finally, (7d) is ungrammatical 
because there is a lack of agreement between the determiner and the noun. It is not 
possible to combine den ‘the’, which is masculine and bears accusative case, and Buch 
‘book’ because Buch is neuter gender. For this reason, the gender properties of these two 
elements are not the same and the elements can therefore not be combined. 

In the following, we will consider how we would have to change our grammar to stop 
it from licensing the sentences in (7b-d). If we want to capture subject-verb agreement, 
then we have to cover the following six cases in German, as the verb has to agree with 
the subject in both person (1, 2, 3) and number (sg, pl): 


(8) a. Ich schlafe. (1, sg) 
I sleep 
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b. Du schläfst. (2, sg) 
you sleep 

c. Er schläft. (3, sg) 
he sleeps 

d. Wir schlafen. (1, pl) 
we sleep 

e. Ihr schlaft. (2, pl) 
you sleep 

f. Sie schlafen. (3, pl) 
they sleep 


It is possible to capture these relations with grammatical rules by increasing the number 
of symbols we use. Instead of the rule S— NP NP NP V, we can use the following: 


(9) S— NP 1sg NP NP V1 sg 
S 5 NP 2 sg NP NP V 2 sg 
S > NP 3 sg NP NP V 3 sg 
S— NP 1 pl NP NP V 1 pl 
S NP 2 pl NP NP V 2 pl 
S 5 NP 3 pl NP NP V 3 pl 


This would mean that we need six different symbols for noun phrases and verbs respec- 
tively, as well as six rules rather than one. 

In order to account for case assignment by the verb, we can incorporate case infor- 
mation into the symbols in an analogous way. We would then get rules such as the 
following: 


(10) S NP 1 sg nom NP dat NP acc V 1 sg nom dat acc 
S — NP 2 sg nom NP dat NP acc V 2 sg nom dat acc 
S — NP 3 sg nom NP dat NP acc V 3 sg nom dat acc 
S —5 NP 1 pl nom NP. dat NP acc V 1 pl nom dat acc 
S — NP 2 pl nom NP dat NP acc V 2 pl nom dat acc 
S — NP 3 pl nom NP dat NP acc V 3 pl nom dat acc 


Since it is necessary to differentiate between noun phrases in four cases, we have a total 
of six symbols for NPs in the nominative and three symbols for NPs with other cases. 
Since verbs have to match the NPs, that is, we have to differentiate between verbs which 
select three arguments and those selecting only one or two (11), we have to increase the 
number of symbols we assume for verbs. 


(1) a. Er schläft. 
he sleeps 
‘He is sleeping’ 
b. *Er schlaft das Buch. 
he sleeps the book 
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c. Erkennt das Buch. 
he knows the book 


‘He knows the book. 


d. * Er kennt. 
he knows 


In the rules above, the information about the number of arguments required by a verb 
is included in the marking ‘nom_dat_acc’. 

In order to capture the determiner-noun agreement in (12), we have to incorporate 
information about gender (fem, mas, neu), number (sg, pl), case (nom, gen, dat, acc) and 
the inflectional classes (strong, weak)’. 


(12) a. der Mann, die Frau, das Buch (gender) 
the.m man(m) the.F woman(F) the.n book(N) 
b. dasBuch, die Bücher (number) 
the book so the books pt. 
c. des Buches, dem Buch (case) 


the.GEN book.GEN the.DAT book 


d. einBeamter, der Beamte (inflectional class) 


a civil.servant the civil.servant 


Instead of the rule NP — Det N, we will have to use rules such as those in (13):° 


N 


(13) 


NP_3_sg_nom — Det 


fem_sg_nom 


fem_sg_nom 


NP_3_sg_nom — Det 


mas sg nom 


mas sg nom 


NP 3 sg nom Det 


neu sg nom 


neu sg nom 


NP 3 pl nom — Det 


fem pl nom 


fem pl nom 


NP 3 pl nom — Det 


mas pl nom 


mas pl nom 


NP 3 pl nom — Det 


neu pl nom 


22722272 


neu_pl_nom 


NP_3_sg_nom — Det 


fem_sg_nom 


fem_sg_nom 


NP_3_sg_nom — Det 


mas sg nom 


mas sg nom 


NP 3 sg nom Det 
NP 3 pl nom — Det 


neu sg nom 
fem pl nom 


neu sg nom 
fem pl nom 


NP 3 pl nom — Det 


mas pl nom 


mas pl nom 


NP 3 pl nom — Det 


neu pl nom 


2272227272 


neu_pl_nom 


(13) shows the rules for nominative noun phrases. We would need analogous rules for 
genitive, dative, and accusative. We would then require 24 symbols for determiners 
(3 * 2 * 4), 24 symbols for nouns and 24 rules rather than one. If inflection class is taken 
into account, the number of symbols and the number of rules doubles. 


‘These are inflectional classes for adjectives which are also relevant for some nouns such as Beamter ‘civil 
servant’, Verwandter ‘relative’, Gesandter ‘envoy’. For more on adjective classes see page 21. 
"to keep things simple, these rules do not incorporate information regarding the inflection class. 
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2.2 Expanding PSG with features 


Phrase structure grammars which only use atomic symbols are problematic as they can- 
not capture certain generalizations. We as linguists can recognize that NP_3_sg_nom 
stands for a noun phrase because it contains the letters NP. However, in formal terms 
this symbol is just like any other symbol in the grammar and we cannot capture the 
commonalities of all the symbols used for NPs. Furthermore, unstructured symbols do 
not capture the fact that the rules in (13) all have something in common. In formal terms, 
the only thing that the rules have in common is that there is one symbol on the left-hand 
side of the rule and two on the right. 

We can solve this problem by introducing features which are assigned to category 
symbols and therefore allow for the values of such features to be included in our rules. 
For example, we can assume the features person, number and case for the category sym- 
bol NP. For determiners and nouns, we would adopt an additional feature for gender 
and one for inflectional class. (14) shows two rules augmented by the respective values 
in brackets: 


(14) NP(@3,sg,nom) — Det(fem,sg,nom) N(fem,sg,nom) 
NP(3,sg,nom) — Det(mas,sg,nom) N(mas,sg,nom) 


If we were to use variables rather than the values in (14), we would get rule schemata as 
the one in (15): 


(15) NP(3,Num,Case) — Det(Gen,Num,Case) N(Gen,Num,Case) 


The values of the variables here are not important. What is important is that they match. 
For this to work, it is important that the values are ordered; that is, in the category of a 
determiner, the gender is always first, number second and so on. The value of the person 
feature (the first position in the NP(3,Num,Case)) is fixed at ‘3’ by the rule. These kind 
of restrictions on the values can, of course, be determined in the lexicon: 


(16) NP@,sgnom)  —es 
Det(mas,sg,nom) — des 


The rules in (10) can be collapsed into a single schema as in (17): 


(17 S — NP(Perl,Num1,nom) 
NP(Per2,Num2,dat) 
NP(Per3,Num3,acc) 
V(Per1,Num1,ditransitive) 


The identification of Perl and Numi on the verb and on the subject ensures that there 
is subject-verb agreement. For the other NPs, the values of these features are irrelevant. 
The case of these NPs is explicitly determined. 


°Chapter 6 introduces attribute value structures. In these structure we always have pairs of a feature name 
and a feature value. In such a setting, the order of values is not important, since every value is uniquely 
identified by the corresponding feature name. Since we do not have a feature name in schemata like (13), 
the order of the values is important. 
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2.3 Semantics 


In the introductory chapter and the previous sections, we have been dealing with syntac- 
tic aspects of language and the focus will remain very much on syntax for the remainder 
of this book. It is, however, important to remember that we use language to commu- 
nicate, that is, to transfer information about certain situations, topics or opinions. If 
we want to accurately explain our capacity for language, then we also have to explain 
the meanings that our utterances have. To this end, it is necessary to understand their 
syntactic structure, but this alone is not enough. Furthermore, theories of language ac- 
quisition that only concern themselves with the acquisition of syntactic constructions 
are also inadequate. The syntax-semantics interface is therefore important and every 
grammatical theory has to say something about how syntax and semantics interact. In 
the following, I will show how we can combine phrase structure rules with semantic 
information. To represent meanings, I will use first-order predicate logic and A-calculus. 
Unfortunately, it is not possible to provide a detailed discussion of the basics of logic 
so that even readers without prior knowledge can follow all the details, but the simple 
examples discussed here should be enough to provide some initial insights into how syn- 
tax and semantics interact and furthermore, how we can develop a linguistic theory to 
account for this. 

To show how the meaning of a sentence is derived from the meaning of its parts, we 
will consider (18a). We assign the meaning in (18b) to the sentence in (18a). 


(18) a. Max schläft. 

Max sleeps 

"Max is sleeping: 

b. schlafen' (max) 

Here, we are assuming schlafen’ to be the meaning of schläft ‘sleeps’. We use prime 
symbols to indicate that we are dealing with word meanings and not actual words. At 
first glance, it may not seem that we have really gained anything by using schlafen’ to 
represent the meaning of (18a), since it is just another form of the verb schläft ‘sleeps’. 


It is, however, important to concentrate on a single verb form as inflection is irrelevant 
when it comes to meaning. We can see this by comparing the examples in (19a) and (19b): 


(19) a. Jeder Junge schlaft. 
every boy sleeps 
‘Every boy sleeps: 


b. Alle Jungen schlafen. 
all boys sleep 


‘All boys sleep: 


To enhance readability I use English translations of the predicates in semantic represen- 
tations from now on.’ So the meaning of (18a) is represented as (20) rather then (18b): 


"Note that I do not claim that English is suited as representation language for semantic relations and con- 
cepts that can be expressed in other languages. 


60 


2.3 Semantics 


(20)  sleep’(max’) 


When looking at the meaning in (20), we can consider which part of the meaning comes 
from each word. It seems relatively intuitive that max’ comes from Max, but the trickier 
question is what exactly schläft ‘sleeps’ contributes in terms of meaning. If we think 
about what characterizes a ‘sleeping’ event, we know that there is typically an individual 
who is sleeping. This information is part of the meaning of the verb schlafen ‘to sleep’. 
The verb meaning does not contain information about the sleeping individual, however, 
as this verb can be used with various subjects: 


(21) a. Paul schläft. 
Paul sleeps 
‘Paul is sleeping: 
b. Mio schlaft. 
Mio sleeps 
“Mio is sleeping; 
c. Xaver schlaft. 
Xaver sleeps 
“Xaver is sleeping: 
We can therefore abstract away from any specific use of sleep’ and instead of, for exam- 
ple, max’ in (20), we use a variable (e.g., x). This x can then be replaced by paul’, mio’ 
or xaver’ in a given sentence. To allow us to access these variables in a given meaning, 


we can write them with a A in front. Accordingly, schläft ‘sleeps’ will have the following 
meaning: 

(22) Ax sleep'(x) 

The step from (20) to (22) is referred to as lambda abstraction. The combination of the 
expression (22) with the meaning of its arguments happens in the following way: we 
remove the A and the corresponding variable and then replace all instances of the variable 
with the meaning of the argument. If we combine (22) and max’ as in (23), we arrive at 
the meaning in (20), namely sleep’(max’). 

(23) Ax sleep'(x) max’ 


The process is called f-reduction or A-conversion. To show this further, let us consider 
an example with a transitive verb. The sentence in (24a) has the meaning given in (24b): 


(24) a. Max mag Lotte. 
Max likes Lotte 


*Max likes Lotte: 
b. like'(max', lotte") 


The A-abstraction of mag ‘likes’ is shown in (25): 


(25) AyÀx like (x, y) 
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Note that it is always the first A that has to be used first. The variable y corresponds to the 
object of mögen ‘to like’. For languages like English it is assumed that the object forms a 
verb phrase (VP) together with the verb and this VP is combined with the subject. Ger- 
man differs from English in allowing more freedom in constituent order. The problems 
that result for form meaning mappings are solved in different ways by different theories. 
The respective solutions will be addressed in the following chapters. 

If we combine the representation in (25) with that of the object Lotte, we arrive at 
(26a), and following f-reduction, (26b): 


(26) a. AyAx like’ (x, y)lotte’ 
b. Ax like’ (x, lotte") 


This meaning can in turn be combined with the subject and we then get (27a) and (27b) 
after f-reduction: 


(27) a. Ax like'(x, lotte") max 
b. like’(max’, lotte") 


After introducing lambda calculus, integrating the composition of meaning into our 
phrase structure rules is simple. A rule for the combination of a verb with its subject 
has to be expanded to include positions for the semantic contribution of the verb, the 
semantic contribution of the subject and then the meaning of the combination of these 
two (the entire sentence). The complete meaning is the combination of the individual 
meanings in the correct order. We can therefore take the simple rule in (28a) and turn it 
into (28b): 


(28) a. S— NP(nom) V 
b. S(V' NP’) > NP(nom, NP’) V(V’) 


V’ stands for the meaning of V and NP’ for the meaning of the NP(nom). V’ NP’ stands for 
the combination of V' and NP’. When analyzing (18a), the meaning of V’ is Ax sleep'(x) 
and the meaning of NP’ is max’. The combination of V’ NP’ corresponds to (29a) or after 
f-reduction to (18b) — repeated here as (29b): 


(29) a. Ax sleep (x) max 
b. sleep'(max') 


For the example with a transitive verb in (24a), the rule in (30) can be proposed: 
(30) S(V' NP2' NP1’) > NP(nom, NP1’) V(V^ NP(acc, NP2’) 


The meaning of the verb (V’) is first combined with the meaning of the object (NP2’) and 
then with the meaning of the subject (NP1’). 

At this point, we can see that there are several distinct semantic rules for the phrase 
structure rules above. The hypothesis that we should analyze language in this way is 
called the rule-to-rule hypothesis (Bach 1976: 184). A more general process for deriving 
the meaning of linguistic expression will be presented in Section 5.1.4. 
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2.4 Phrase structure rules for some aspects of German 
syntax 


Whereas determining the direct constituents of a sentence is relative easy, since we can 
very much rely on the movement test due to the somewhat flexible order of constituents 
in German, it is more difficult to identify the parts of the noun phrase. This is the problem 
we will focus on in this section. To help motivate assumptions about X syntax to be 
discussed in Section 2.5, we will also discuss prepositional phrases. 


2.4.1 Noun phrases 


Up to now, we have assumed a relatively simple structure for noun phrases: our rules 
state that a noun phrase consists of a determiner and a noun. Noun phrases can have a 
distinctly more complex structure than (31a). This is shown by the following examples 
in (31): 


(31) a. eine Frau 
a woman 


b. eine Frau, die wir kennen 
a woman who we know 


c. eine Frau aus Stuttgart 
a woman from Stuttgart 


d. eine kluge Frau 
a smart woman 


e. eine Frau aus Stuttgart, die wir kennen 
a woman from Stuttgart who we know 


f. eine kluge Frau aus Stuttgart 
a smart woman from Stuttgart 


g. eine kluge Frau, die wir kennen 
a smart woman who we know 


h. eine kluge Frau aus Stuttgart, die wir kennen 
a smart woman from Stuttgart who we know 


As well as determiners and nouns, noun phrases can also contain adjectives, preposi- 
tional phrases and relative clauses. The additional elements in (31) are adjuncts. They 
restrict the set of objects which the noun phrase refers to. Whereas (31a) refers to a be- 
ing which has the property of being a woman, the referent of (31b) must also have the 
property of being known to us. 

Our previous rules for noun phrases simply combined a noun and a determiner and 
can therefore only be used to analyze (31a). The question we are facing now is how we 
can modify this rule or which additional rules we would have to assume in order to 
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analyze the other noun phrases in (31). In addition to rule (32a), one could propose a rule 
such as the one in (32b).*? 
(32) a. NP—DetN 

b. NP > Det AN 


However, this rule would still not allow us to analyze noun phrases such as (33): 
(33) alle weiteren schlagkraftigen Argumente 
all further strong arguments 


‘all other strong arguments’ 
In order to be able to analyze (33), we require a rule such as (34): 
(34) NP—DetAAN 


It is always possible to increase the number of adjectives in a noun phrase and setting an 
upper limit for adjectives would be entirely arbitrary. Even if we opt for the following 
abbreviation, there are still problems: 


(35) NP — Det A*N 


The asterisk in (35) stands for any number of iterations. Therefore, (35) encompasses 
rules with no adjectives as well as those with one, two or more. 

The problem is that according to the rule in (35) adjectives and nouns do not form a 
constituent and we can therefore not explain why coordination is still possible in (36): 


(36) alle [[großen Seeelefanten] und [grauen Eichhörnchen]] 
all big elephant.seals and grey squirrels 


‘all the big elephant seals and grey squirrels’ 


If we assume that coordination involves the combination of two or more word strings 
with the same syntactic properties, then we would have to assume that the adjective and 
noun form a unit. 

The following rules capture the noun phrases with adjectives discussed thus far: 


(37) a. NP Det N 
b NO AN 
c NON 


These rules state the following: a noun phrase consists of a determiner and a nominal 
element (N). This nominal element can consist of an adjective and a nominal element 
(37b), or just a noun (37c). Since N is also on the right-hand side of the rule in (37b), we 
can apply this rule multiple times and therefore account for noun phrases with multiple 
adjectives such as (33). Figure 2.3 on the next page shows the structure of a noun phrase 
without an adjective and that of a noun phrase with one or two adjectives. The adjective 


5See Eisenberg (2004: 238) for the assumption of flat structures in noun phrases. 
?There are, of course, other features such as gender and number, which should be part of all the rules 
discussed in this section. I have omitted these in the following for ease of exposition. 
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NP 
ge ee 
NP Det N 
ee E EE 
NP Det N A N 

Det N A N A N 

| | | 
N N N 

| | | 


ein Eichhórnchen ein graues Eichhórnchen ein grofes graues Eichhórnchen 


a squirrel a grey squirrel a big grey squirrel 


Figure 2.3: Noun phrases with differing numbers of adjectives 


grau ‘grey’ restricts the set of referents for the noun phrase. If we assume an additional 
adjective such as grof ‘big’, then it only refers to those squirrels who are grey as well as 
big. These kinds of noun phrases can be used in contexts such as the following: 


(38) A: Alle grauen Eichhörnchen sind groß. 
all grey squirrels are big 


‘All grey squirrels are big’ B: Nein, ich habe ein kleines graues Eichhörnchen 
no I havea small grey squirrel 

gesehen. 

seen 


‘No, I saw a small grey squirrel: 


We observe that this discourse can be continued with Aber alle kleinen grauen Eich- 
hórnchen sind krank ‘but all small grey squirrels are ill’ and a corresponding answer. 
The possibility to have even more adjectives in noun phrases such as ein kleines graues 
Eichhórnchen ‘a small grey squirrel’ is accounted for in our rule system in (37). In the 
rule (37b), N occurs on the left as well as the right-hand side of the rule. This kind of rule 
is referred to as recursive. 

We have now developed a nifty little grammar that can be used to analyze noun 
phrases containing adjectival modifiers. As a result, the combination of an adjective 
and noun is given constituent status. One may wonder at this point if it would not make 
sense to also assume that determiners and adjectives form a constituent, as we also have 
the following kind of noun phrases: 


(39) diese schlauen und diese neugierigen Frauen 
these smart and these curious women 
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Here, we are dealing with a different structure, however. Two full NPs have been con- 
joined and part of the first conjunct has been deleted. 


(40) diese schlauen Frauen und diese neugierigen Frauen 
these smart women and these curious women 


One can find similar phenomena at the sentence and even word level: 


(41) a. dass Peter dem Mann das Buch gibt und Maria der Frau die Schallplatte 
that Peter the man the book gives and Maria the woman the record 
gibt 
gives 
‘that Peters gives the book to the man and Maria the record to the woman’ 


b. be- und ent-laden 
PRT and PRT-load 


‘load and unload’ 


Thus far, we have discussed how we can ideally integrate adjectives into our rules for 
the structure of noun phrases. Other adjuncts such as prepositional phrases or relative 
clauses can be combined with N in an analogous way to adjectives: 


(42) a N—NPP 
b. N —> N relative clause 
With these rules and those in (37), it is possible - assuming the corresponding rules for 
PPs and relative clauses - to analyze all the examples in (31). 
(37c) states that it is possible for N to consist of a single noun. A further important 
rule has not yet been discussed: we need another rule to combine nouns such as Vater 
‘father’, Sohn ‘son’ or Bild ‘picture’, so-called relational nouns, with their arguments. 


Examples of these can be found in (43a-b). (43c) is an example of a nominalization of a 
verb with its argument: 


(43) a. der Vater von Peter 
the father of Peter 
"Peter's father 


b. dasBild vom Gleimtunnel 
the picture of.the Gleimtunnel 


‘the picture of the Gleimtunnel’ 


c. das Kommen des Installateurs 
the coming of.the plumber 


“the plumber’s visit’ 
The rule that we need to analyze (43a,b) is given in (44): 
(44) N—NPP 
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NP 
ie I I 
NP Det N 
era m E c 
Det N N PP 
19 as pU ee, 
N PP N PP 
EE 

das gl vom Gleimtunnel das a vom Gleimtunnel im Gropiusbau 
the picture ofthe Gleimtunnel the picture of.the Gleimtunnel in.the Gropiusbau 


Figure 2.4: Combination of a noun with PP complement vom Gleimtunnel to the right 
with an adjunct PP 


Figure 2.4 shows two structures with PP-arguments. The tree on the right also contains 
an additional PP-adjunct, which is licensed by the rule in (42a). 

In addition to the previously discussed NP structures, there are other structures where 
the determiner or noun is missing. Nouns can be omitted via ellipsis. (45) gives an 
example of noun phrases, where a noun that does not require a complement has been 
omitted. The examples in (46) show NPs in which only one determiner and complement 
of the noun has been realized, but not the noun itself. The underscore marks the position 
where the noun would normally occur. 


(45) a. eine kluge _ 
a smart 
‘a smart one’ 


b. eine kluge große _ 
a smart tall 


‘a smart tall one’ 


c. eine kluge _aus Hamburg 
a smart from Hamburg 


‘a smart one from Hamburg’ 


d. eine kluge _, die alle kennen 
a smart who everyone knows 


« , 
a smart one who everyone knows 


(46) a. (Nein, nicht der Vater von Klaus), der _ von Peter war gemeint. 
no not thefatherof Klaus the of Peter was meant 


*No, it wasn't the father of Klaus, but rather the one of Peter that was meant: 
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b. (Nein, nicht das Bild von der Stadtautobahn), das vom Gleimtunnel war 
no not thepictureof the motorway the ofthe Gleimtunnel was 
beeindruckend. 
impressive 
‘No, it wasn’t the picture of the motorway, but rather the one of the Gleimtun- 
nel that was impressive: 
c. (Nein, nicht das Kommen des Tischlers), das | des Installateurs ist 
no not the coming ofthe carpenter the of.the plumber is 
wichtig. 
important 
‘No, it isn’t the visit of the carpenter, but rather the visit of the plumber that 
is important’ 


In English, the pronoun one must often be used in the corresponding position,” but in 
German the noun is simply omitted. In phrase structure grammars, this can be described 
by a so-called epsilon production. These rules replace a symbol with nothing (47a). The 
rule in (47b) is an equivalent variant which is responsible for the term epsilon production: 
(47) a No 

b Noe 


The corresponding trees are shown in Figure 2.5. Going back to boxes, the rules in (47) 


NP 
me NP 
] Po 
ipu eL 
eine kluge | das vom Gleimtunnel 
a smart the of.the Gleimtunnel 


Figure 2.5: Noun phrases without an overt head 


correspond to empty boxes with the same labels as the boxes of ordinary nouns. As we 
have considered previously, the actual content of the boxes is unimportant when con- 
sidering the question of where we can incorporate them. In this way, the noun phrases 
in (31) can occur in the same sentences. The empty noun box also behaves like one with 


0 See Fillmore et al. (2012: Section 4.12) for English examples without the pronoun one. 
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a genuine noun. If we do not open the empty box, we will not be able to ascertain the 
difference to a filled box. 

It is not only possible to omit the noun from noun phrases, but the determiner can 
also remain unrealized in certain contexts. (48) shows noun phrases in plural: 


(48) a. Frauen 
women 


b. Frauen, die wir kennen 
women who we know 


c. kluge Frauen 
smart women 


d. kluge Frauen, die wir kennen 
smart women who we know 
The determiner can also be omitted in singular ifthe noun denotes a mass noun: 
(49) a. Getreide 
grain 
b. Getreide, das gerade gemahlen wurde 
grain thatjust ground was 
‘grain that has just been ground’ 


c. frisches Getreide 
fresh grain 


d. frisches Getreide, das gerade gemahlen wurde 
fresh grain thatjust ground was 


‘fresh grain that has just been ground’ 
Finally, both the determiner and the noun can be omitted: 
(50) a. Ichhelfe klugen. 

I help smart 

‘T help smart ones: 


b. Dort drüben steht frisches, das gerade gemahlen wurde. 
there over stands fresh thatjust ground was 


‘Over there is some fresh (grain) that has just been ground! 


Figure 2.6 on the next page shows the corresponding trees. 

It is necessary to add two further comments to the rules we have developed up to this 
point: up to now, I have always spoken of adjectives. However, it is possible to have 
very complex adjective phrases in pre-nominal position. These can be adjectives with 
complements (51a,b) or adjectival participles (51c,d): 


(51) a. der seiner Frau treue Mann 
the bis par wife faithful man 


‘the man faithful to his wife’ 
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NP 
FR 
NP Det N 
e a 
Det N AN 
| | 
N N 
| | 
Frauen — klugen _ 
women smart 


Figure 2.6: Noun phrases without overt determiner 


b. der auf seinen Sohn stolze Mann 
the on his.acc son proud man 


‘the man proud of his son’ 


c. der seine Frau liebende Mann 
the his.acc woman loving man 


‘the man who loves his wife’ 


d. der von seiner Frau geliebte Mann 
the by his.pat wife loved man 


‘the man loved by his wife’ 
Taking this into account, the rule (37b) has to be modified in the following way: 
(52) N>APN 


An adjective phrase (AP) can consist of an NP and an adjective, a PP and an adjective or 
just an adjective: 


(53) a. AP—NPA 
b AP— PPA 
c. APO A 


There are two imperfections resulting from the rules we have developed thus far. These 
are the rules for adjectives or nouns without complements in (53c) as well as (37c) - 
repeated here as (54): 


(54) NON 


If we apply these rules, then we will generate unary branching subtrees, that is trees 
with a mother that only has one daughter. See Figure 2.6 for an example of this. If we 
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maintain the parallel to the boxes, this would mean that there is a box which contains 
another box which is the one with the relevant content. 

In principle, nothing stops us from placing this information directly into the larger 
box. Instead of the rules in (55), we will simply use the rules in (56): 


(55) a. A kluge 
b N Mann 

(56) a. AP > kluge 
b. N Mann 


(56a) states that kluge ‘smart’ has the same properties as a full adjective phrase, in partic- 
ular that it cannot be combined with a complement. This is parallel to the categorization 
of the pronoun er ‘he’ as an NP in the grammars (2) and (6). 

Assigning N to nouns which do not require a complement has the advantage that we 
do not have to explain why the analysis in (57b) is possible as well as (57a) despite there 
not being any difference in meaning. 


(57) a. [np einige [y kluge Le [s [n Frauen ] und [5 [n Manner ]]]]]] 
some smart women and men 


b. [np einige [x kluge [x [n In Frauen ] und [„ Manner ]]]]] 
some smart women and men 


In (57a), two nouns have projected to N and have then been joined by coordination. 
The result of coordination of two constituents of the same category is always a new 
constituent with that category. In the case of (57a), this is also N. This constituent is then 
combined with the adjective and the determiner. In (57b), the nouns themselves have 
been coordinated. The result of this is always another constituent which has the same 
category as its parts. In this case, this would be N. This N becomes N and is then combined 
with the adjective. If nouns which do not require complements were categorized as N 
rather than N, we would not have the problem of spurious ambiguities. The structure in 
(58) shows the only possible analysis. 


(58) [np einige [5 kluge [5 [x Frauen ] und [x Männer ]]]] 
some smart women and men 
2.4.2 Prepositional phrases 


Compared to the syntax of noun phrases, the syntax of prepositional phrases (PPs) is 
relatively straightforward. PPs normally consist of a preposition and a noun phrase 
whose case is determined by that preposition. We can capture this with the following 
rule: 


(59) PP — PNP 


This rule must, of course, also contain information about the case of the NP. I have 
omitted this for ease of exposition as I did with the NP-rules and AP-rules above. 
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The Duden grammar (Eisenberg et al. 2005: §1300) offers examples such as those in 
(60), which show that certain prepositional phrases serve to further define the semantic 
contribution of the preposition by indicating some measurement, for example: 


(60) a. [[Einen Schritt] vor dem Abgrund] blieb er stehen. 
one step before the abyss remained he stand 


‘He stopped one step in front of the abyss’ 


b. [[Kurz] nach dem Start] fiel die Klimaanlage aus. 
shortly after the take.off fell the air.conditioning out 


‘Shortly after take off, the air conditioning stopped working: 


c. [[Schrág] hinter der Scheune] ist ein Weiher. 
diagonally behind the barn is a pond 


“There is a pond diagonally across from the barn’ 


d. [[Mitten] im Urwald] stießen die Forscher auf einen alten Tempel. 
middle in.the jungle stumbled the researcherson an old temple 


‘In the middle of the jungle, the researches came across an old temple’ 
To analyze the sentences in (60a,b), one could propose the following rules in (61): 


(61) a. PP — NP PP 
b. PP > AP PP 


These rules combine a PP with an indication of measurement. The resulting constituent 
is another PP. It is possible to use these rules to analyze prepositional phrases in (60a,b), 
but it unfortunately also allows us to analyze those in (62): 


(62) a. * [pp einen Schritt [pp kurz [pp vor dem Abgrund]]] 
one step shortly before the abyss 


b. *[pp kurz [pp einen Schritt [pp vor dem Abgrund]]] 
shortly one step before the abyss 


Both rules in (61) were used to analyze the examples in (62). Since the symbol PP occurs 
on both the left and right-hand side of the rules, we can apply the rules in any order and 
as many times as we like. 

We can avoid this undesired side-effect by reformulating the previously assumed rules: 


(63) a. PP>NPP 
b. PP > APP 
c. PPP 
d. P>PNP 
Rule (59) becomes (63d). The rule in (63c) states that a PP can consist of P. Figure 2.7 on 


the facing page shows the analysis of (64) using (63c) and (63d) as well as the analysis of 
an example with an adjective in the first position following the rules in (63b) and (63d): 
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(64) vor dem Abgrund 
before the abyss 


‘in front of the abyss’ 


PP PP 
: AP P 
— po 
Se Be 
4 dem Abgrund kurz vor dem Abgrund 
before the abyss shortly before the abyss 


Figure 2.7: Prepositional phrases with and without measurement 


At this point, the attentive reader is probably wondering why there is no empty mea- 
surement phrase in the left figure of Figure 2.7, which one might expect in analogy to the 
empty determiner in Figure 2.6. The reason for the empty determiner in Figure 2.6 is that 
the entire noun phrase without the determiner has a meaning similar to those with a de- 
terminer. The meaning normally contributed by the visible determiner has to somehow 
be incorporated in the structure of the noun phrase. If we did not place this meaning in 
the empty determiner, this would lead to more complicated assumptions about semantic 
combination: we only really require the mechanisms presented in Section 2.3 and these 
are very general in nature. The meaning is contributed by the words themselves and not 
by any rules. If we were to assume a unary branching rule such as that in the left tree in 
Figure 2.7 instead of the empty determiner, then this unary branching rule would have 
to provide the semantics of the determiner. This kind of analysis has also been proposed 
by some researchers. See Chapter 19 for more on empty elements. 

Unlike determiner-less NPs, prepositional phrases without an indication of degree or 
measurement do not lack any meaning component for composition. It is therefore not 
necessary to assume an empty indication of measurement, which somehow contributes 
to the meaning of the entire PP. Hence, the rule in (63c) states that a prepositional phrase 
consists of P, that is, a combination of P and NP. 


2.5 X theory 


If we look again at the rules that we have formulated in the previous section, we see that 
heads are always combined with their complements to form a new constituent (65a,b), 
which can then be combined with further constituents (65c,d): 


(5) a. NO NPP 
b P>PNP 
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c. NP > Det N 
d. PP— NP P 


Grammarians working on English noticed that parallel structures can be used for phrases 
which have adjectives or verbs as their head. I discuss adjective phrases at this point and 
postpone the discussion of verb phrases to Chapter 3. As in German, certain adjectives in 
English can take complements with the important restriction that adjective phrases with 
complements cannot realize these pre-nominally in English. (66) gives some examples 
of adjective phrases: 


(66) a. Heis proud. 
b. He is very proud. 
c. He is proud of his son. 


d. He is very proud of his son. 


Unlike prepositional phrases, complements of adjectives are normally optional. proud 
can be used with or without a PP. The degree expression very is also optional. 

The rules which we need for this analysis are given in (67), with the corresponding 
structures in Figure 2.8. 


(67) a. AP>A 

b. AP AdvP A 

c. A APP 

d AA 

AP AP AP AP 

A AdvP A A AdvP A 

A A A PP A PP 

proud very proud proud of his son very proud of his son 


Figure 2.8: English adjective phrases 


As was shown in Section 2.2, it is possible to generalize over very specific phrase 
structure rules and thereby arrive at more general rules. In this way, properties such as 
person, number and gender are no longer encoded in the category symbols, but rather 
only simple symbols such as NP, Det and N are used. It is only necessary to specify 
something about the values of a feature if it is relevant in the context of a given rule. We 
can take this abstraction a step further: instead of using explicit category symbols such 
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as N, V, P and A for lexical categories and NP, VP, PP and AP for phrasal categories, one 
can simply use a variable for the word class in question and speak of X and XP. 

This form of abstraction can be found in so-called X theory (or X-bar theory, the term 
bar refers to the line above the symbol), which was developed by Chomsky (1970) and 
refined by Jackendoff (1977). This form of abstract rules plays an important role in many 
different theories. For example: Government & Binding (Chapter 3), Generalized Phrase 
Structure Grammar (Chapter 5) and Lexical Functional Grammar (Chapter 7). In HPSG 
(Chapter 9), X theory also plays a role, but not all restrictions of the X schema have been 
adopted. 

(68) shows a possible instantiation of X rules, where the category X has been used in 
place of N, as well as examples of word strings which can be derived by these rules: 


(68) X rule with specific categories example strings 
X> specifier X N DET N the [picture of Paris] 
X 5 X adjunct N — N REL CLAUSE [picture of Paris] 


[that everybody knows] 


X — adjunct X N-AN beautiful [picture of Paris] 


X 9 X complements N—N P picture [of Paris] 


Any word class can replace X (e.g., V, A or P). The X without the bar stands for a lexical 
item in the above rules. If one wants to make the bar level explicit, then it is possible 
to write X°. Just as with the rule in (15), where we did not specify the case value of the 
determiner or the noun but rather simply required that the values on the right-hand side 
of the rule match, the rules in (68) require that the word class of an element on the right- 
hand side of the rule (X or X) matches that of the element on the left-hand side of the 
rule (X or X). 

A lexical element can be combined with all its complements. The * in the last rule 
stands for an unlimited amount of repetitions of the symbol it follows. A special case is 
zerofold occurrence of complements. There is no PP complement of Bild ‘picture’ present 
in das Bild ‘the picture’ and thus N becomes N. The result of the combination of a lexical 
element with its complements is a new projection level of X: the projection level 1, which 
is marked by a bar. X can then be combined with adjuncts. These can occur to the left 
or right of X. The result of this combination is still X, that is the projection level is not 
changed by combining it with an adjunct. Maximal projections are marked by two bars. 
One can also write XP for a projection of X with two bars. An XP consists of a specifier 
and X. Depending on one’s theoretical assumptions, subjects of sentences (Haider 1995; 
1997a; Berman 2003a: Section 3.2.2) and determiners in NPs (Chomsky 1970: 210) are 
specifiers. Furthermore, degree modifiers (Chomsky 1970: 210) in adjective phrases and 
measurement indicators in prepositional phrases are also counted as specifiers. 

Non-head positions can only host maximal projections and therefore complements, 
adjuncts and specifiers always have two bars. Figure 2.9 on the following page gives an 
overview of the minimal and maximal structure of phrases. 
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XP 
XP specifier X 
adjunct X 


ndis 


complement X 


xK — x— 


Figure 2.9: Minimal and maximal structure of phrases 


Some categories do not have a specifier or have the option of having one. Adjuncts are 
optional and therefore not all structures have to contain an X with an adjunct daughter. 
In addition to the branching shown in the right-hand figure, adjuncts to XP and head- 
adjuncts are sometimes possible. There is only a single rule in (68) for cases in which a 
head precedes the complements, however an order in which the complement precedes 
the head is of course also possible. This is shown in Figure 2.9. 

Figure 2.10 on the next page shows the analysis of the NP structures das Bild ‘the pic- 
ture’ and das schöne Bild von Paris ‘the beautiful picture of Paris’. The NP structures in 
Figure 2.10 and the tree for proud in Figure 2.8 show examples of minimally populated 
structures. The left tree in Figure 2.10 is also an example of a structure without an ad- 
junct. The right-hand structure in Figure 2.10 is an example for the maximally populated 
structure: specifier, adjunct, and complement are present. 

The analysis given in Figure 2.10 assumes that all non-heads in a rule are phrases. 
One therefore has to assume that there is a determiner phrase even if the determiner is 
not combined with other elements. The unary branching of determiners is not elegant 
but it is consistent.” The unary branchings for the NP Paris in Figure 2.10 may also 
seem somewhat odd, but they actually become more plausible when one considers more 
complex noun phrases: 


(69) a. dasParis der dreißiger Jahre 
the Paris ofthe thirty years 
*30's Paris’ 
b. die Mariaaus Hamburg 


the Maria from Hamburg 


"Maria from Hamburg’ 


Unary projections are somewhat inelegant but this should not concern us too much 
here, as we have already seen in the discussion of the lexical entries in (56) that unary 
branching nodes can be avoided for the most part and that it is indeed desirable to avoid 
such structures. Otherwise, one gets spurious ambiguities. In the following chapters, we 


"For an alternative version of X theory which does not assume elaborate structure for determiners see 
Muysken (1982). 
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NP 
pM 
DetP N 
rca eee 
Det AP N 
Det A m 
| 
NP A P 
a SE 
Det N N 
Det N 
das Bild das schöne Bild von Paris 
the picture the beautiful picture of Paris 


Figure 2.10: X analysis of das Bild ‘the picture’ and das schöne Bild von Paris ‘the beautiful 
picture of Paris’ 


will discuss approaches such as Categorial Grammar and HPSG, which do not assume 
unary rules for determiners, adjectives and nouns. 

Furthermore, other X theoretical assumptions will not be shared by several theories 
discussed in this book. In particular, the assumption that non-heads always have to 
be maximal projections will be disregarded. Pullum (1985) and Kornai & Pullum (1990) 
have shown that the respective theories are not necessarily less restrictive than theories 
which adopt a strict version of the X theory. See also the discussion in Section 13.1.2. 
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Comprehension questions 


1. Why are phrase structure grammars that use only atomic categories inade- 
quate for the description of natural languages? 


2. Assuming the grammar in (6), state which steps (replacing symbols) one has 
to take to get to the symbol V in the sentence (70). 


(70) er das Buch dem Mann gibt 
he the book the man gives 


‘He gives the book to the man’ 


Your answer should resemble the analysis in (3). 


3. Give a representation of the meaning of (71) using predicate logic: 


(71) a. Ulrike kennt Hans. 
Ulrike knows Hans 


b. Joshi freut sich. 
Joshi is.happy REFL 


‘Joshi is happy? 


Exercises 


1. On page 55, I claimed that there is an infinite number of grammars we could 
use to analyze (1). Why is this claim correct? 


2. Try to come up with some ways in which we can tell which of these possible 
grammars is or are the best? 


3. A fragment for noun phrase syntax was presented in Section 2.4.1. Why is the 
interaction of the rules in (72) problematic? 


2.5 X Theory 


NP — Det N 
NON 
Det > € 
d. Ne 


(72) 


ep 


‘© 


. Why is it not a good idea to mark books as NP in the lexicon? 


. Can you think of some reasons why it is not desirable to assume the following 
rule for nouns such as books: 


(73) NP — Modifier* books Modifier* 


The rule in (73) combines an unlimited number of modifiers with the noun 
books followed by an unlimited number of modifiers. We can use this rule to 
derive phrases such as those in (74): 


(74) a. books 
b. interesting books 


c. interesting books from Stuttgart 


Make reference to coordination data in your answer. Assume that symmetric 
coordination requires that both coordinated phrases or words have the same 
syntactic category. 


. Fillmore, Lee-Goldmann & Rhomieux (2012) suggested treating nounless struc- 
tures like those in (75) as involving a phrasal construction combining the de- 
terminer the with an adjective. 


(75) a. Examine the plight of the very poor. 
b. Their outfits range from the flamboyant to the functional. 


c. Ihe unimaginable happened. 
(76) shows a phrase structure rule that corresponds to their construction: 
(76) NP — the Adj 


Adj stands for something that can be a single word like poor or complex like 
very poor. 


Revisit the German data in (45) and (46) and explain why such an analysis and 
even a more general one as in (77) would not extend to German. 
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(77) NP — Det Adj 


. Why can X theory not account for German adjective phrases without addi- 


tional assumptions? (This task is for (native) speakers of German only.) 


. Come up with a phrase structure grammar that can be used to analyze the 


sentence in (78), but also rules out the sentences in (79). 


(78) a. Der Mann hilft dem Kind. 

the.nom man helps the.par child 
"Ihe man helps the child’ 

b. Er gibt ihr das Buch. 
he.NoM gives her.DAT the book 
‘He gives her the book. 

& Bir wartet auf ein Wunder. 
he.NoM waits on a miracle 


‘He is waiting for a miracle’ 


(79) a. *Der Mann hilft er. 
the.nom man helps he.NoM 


b. *Er gibt ihr den Buch. 
he.NoM gives her.DAT the.m book.n 


. Consider which additional rules would have to be added to the grammar you 


developed in the previous exercise in order to be able to analyze the following 
sentences: 


(80) a. Der Mannhilft dem Kind jetzt. 
the.nom man helps the.DAT child now 


"Ihe man helps the child now? 


b. Der Mann hilft dem Kind neben dem Bushäuschen. 
the.nom man helps the.pAT child next.to the bus.shelter 


"Ihe man helps the child next to the bus shelter. 


c. Er gibt ihr das Buch jetzt. 
he.NoM gives ber pat the.acc book now 


"He gives her the book now: 


d. Er gibt ihr das Buch neben dem Bushäuschen. 
he.NoM gives her.DAT the.acc book next.to the bus.shelter 


‘He gives her the book next to the bus shelter: 


2.5 X theory 


e. Er wartet jetzt auf ein Wunder. 
he.NoM waits nowon a miracle 


‘He is waiting for a miracle now. 


f. Er wartet neben dem Bushauschen auf ein Wunder. 
he.NoM waits next.to the.DAT bus.shelter on a miracle 


‘He is waiting for a miracle next to the bus shelter: 


10. Install a Prolog system (e.g., SWI-Prolog^) and try out your grammar. SWI- 
Prolog also comes with an online version where you can input your grammar 
directly without any installation." Details regarding the notation can be found 
in the English Wikipedia entry for Definite Clause Grammar (DCG).° 

“http://www.swi-prolog.org, 2018-02-20. 


Phttps://swish.swi-prolog.org/, 2018-02-20. 
“https://en.wikipedia.org/wiki/Definite_clause_grammar, 2018-02-20. 


Further reading 


The expansion of phrase structure grammars to include features was proposed as 
early as 1963 by Harman (1963). 

The phrase structure grammar for noun phrases discussed in this chapter cov- 
ers a large part of the syntax of noun phrases but cannot explain certain NP struc- 
tures. Furthermore, it has the problem, which exercise 3 is designed to show. A 
discussion of these phenomena and a solution in the framework of HPSG can be 
found in Netter (1998) and Kiss (2005). 

The discussion of the integration of semantic information into phrase structure 
grammars was very short. A detailed discussion of predicate logic and its integra- 
tion into phrase structure grammars - as well as a discussion of quantifier scope 
— can be found in Blackburn & Bos (2005). 
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3 Transformational Grammar - 
Government & Binding 


Transformational Grammar and its subsequent incarnations (such as Government and 
Binding Theory and Minimalism) were developed by Noam Chomsky at MIT in Boston 
(Chomsky 1957; 1965; 1975; 1981a; 1986a; 1995b). Manfred Bierwisch (1963) was the first to 
implement Chomsky’s ideas for German. In the 60s, the decisive impulse came from the 
Arbeitsstelle Strukturelle Grammatik “Workgroup for Structural Grammar’, which was 
part of the Academy of Science of the GDR. See Bierwisch 1992 and Vater 2010 for a 
historic overview. As well as Bierwisch’s work, the following books focusing on German 
or the Chomskyan research program in general should also be mentioned: Fanselow 
(1987), Fanselow & Felix (1987), von Stechow & Sternefeld (1988), Grewendorf (1988), 
Haider (1993), Sternefeld (2006). 

The different implementations of Chomskyan theories are often grouped under the 
heading Generative Grammar. This term comes from the fact that phrase structure gram- 
mars and the augmented frameworks that were suggested by Chomsky can generate sets 
of well-formed expressions (see p. 54). It is such a set of sentences that constitutes a lan- 
guage (in the formal sense) and one can test if a sentence forms part of a language by 
checking if a particular sentence is in the set of sentences generated by a given gram- 
mar. In this sense, simple phrase structure grammars and, with corresponding formal 
assumptions, GPSG, LFG, HPSG and Construction Grammar (CxG) are generative theo- 
ries. In recent years, a different view of the formal basis of theories such as LFG, HPSG 
and CxG has emerged such that the aforementioned theories are now model theoretic 
theories rather than generative-enumerative ones! (See Chapter 14 for discussion). In 
1965, Chomsky defined the term Generative Grammar in the following way (see also 
Chomsky 1995b: 162): 


A grammar of a language purports to be a description of the ideal speaker-hearer’s 
intrinsic competence. If the grammar is, furthermore, perfectly explicit - in other 
words, if it does not rely on the intelligence of the understanding reader but rather 
provides an explicit analysis of his contribution — we may call it (somewhat redun- 
dantly) a generative grammar. (Chomsky 1965: 4) 


In this sense, all grammatical theories discussed in this book would be viewed as gen- 
erative grammars. To differentiate further, sometimes the term Mainstream Generative 
Grammar (MGG) is used (Culicover & Jackendoff 2005: 3) for Chomskyan models. In this 


! Model theoretic approaches are always constraint-based and the terms model theoretic and constraint-based 
are sometimes used synonymously. 
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chapter, I will discuss a well-developed and very influential version of Chomskyan gram- 
mar, GB theory. More recent developments following Chomsky’s Minimalist Program 
are dealt with in Chapter 4. 


3.1 General remarks on the representational format 


This section provides an overview of general assumptions. I introduce the concept of 
transformations in Section 3.1.1. Section 3.1.2 provides background information about 
assumptions regarding language acquisition, which shaped the theory considerably, Sec- 
tion 3.1.3 introduces the so-called T model, the basic architecture of GB theory. Sec- 
tion 3.1.4 introduces the X theory in the specific form used in GB and Section 3.1.5 shows 
how this version of the X theory can be applied to English. The discussion of the analysis 
of English sentences is an important prerequisite for the understanding of the analysis 
of German, since many analyses in the GB framework are modeled in parallel to the 
analyses of English. Section 3.1.6 introduces the analysis of German clauses in a parallel 
way to what has been done for English in Section 3.1.5. 


3.1.1 Transformations 


In the previous chapter, I introduced simple phrase structure grammars. Chomsky (1957: 
Chapter 5) criticized this kind of rewrite grammars since - in his opinion - it is not clear 
how one can capture the relationship between active and passive sentences or the vari- 
ous ordering possibilities of constituents in a sentence. While it is of course possible to 
formulate different rules for active and passive sentences in a phrase structure grammar 
(e.g., one pair of rules for intransitive (1), one for transitive (2) and one for ditransitive 
verbs (3)), it would not adequately capture the fact that the same phenomenon occurs in 
the example pairs in (1)-(3): 
(1) a. weil dort noch jemand arbeitet 
because there still somebody works 
‘because somebody is still working there’ 


b. weil dort noch gearbeitet wurde 
because there still worked was 


‘because work was still being done there’ 


(2 a. weil er den Weltmeister schlägt 
because he the world.champion beats 


‘because he beats the world champion’ 


b. weil der Weltmeister geschlagen wurde 
because the world.champion beaten was 


‘because the world champion was beaten’ 


84 


3.1 General remarks on the representational format 


(3) a. weil der Mann der Frau den Schlüssel stiehlt 
because the man the woman the key steals 
‘because the man is stealing the key from the woman’ 

b. weil ` der Frau der Schlüssel gestohlen wurde 
because the woman the key stolen was 


‘because the key was stolen from the woman’ 


Chomsky (1957: 43) suggests a transformation that creates a connection between active 
and passive sentences. The transformation that he suggests for English corresponds to 
(4), which is taken from Klenk (2003: 74): 


(4) NP V NP > 3 [aux be] Zen [pp [p by] 1] 
1 23 


This transformational rule maps a tree with the symbols on the left-hand side of the rule 
onto a tree with the symbols on the right-hand side of the rule. Accordingly, 1, 2 and 
3 on the right of the rule correspond to symbols, which are under the numbers on the 
left-hand side. en stands for the morpheme which forms the participle (seen, been, ..., but 
also loved). Both trees for (5a,b) are shown in Figure 3.1. 


(5) a. John loves Mary. 
b. Mary is loved by John. 


S 
p od ee, 
S NP VP 
NP VP = Mary Aux V PP 
|! a os a 
John V NP P NP 
| | | d 
loves Mary is loved by John 


Figure 3.1: Application of passive transformation 


The symbols on the left of transformational rules do not necessarily have to be in a local 
tree, that is, they can be daughters of different mothers as in Figure 3.1. 

Rewrite grammars were divided into four complexity classes based on the properties 
they have. The simplest grammars are assigned to the class 3, whereas the most complex 
are of Type-0. The so-called context-free grammars we have dealt with thus far are of 
Type-2. Transformational grammars which allow symbols to be replaced by arbitrary 
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other symbols are of Type-0 (Peters & Ritchie 1973). Research on the complexity of natu- 
ral languages shows that the highest complexity level (Type-0) is too complex for natural 
language. It follows from this - assuming that one wants to have a restrictive formal ap- 
paratus for the description of grammatical knowledge (Chomsky 1965: 62) - that the 
form and potential power of transformations has to be restricted.” Another criticism of 
early versions of transformational grammar was that, due to a lack of restrictions, the 
way in which transformations interact was not clear. Furthermore, there were problems 
associated with transformations which delete material (see Peters & Ritchie 1973; Klenk 
2003: Section 3.1.4). For this reason, new theoretical approaches such as Government 
& Binding (Chomsky 1981a) were developed. In this model, the form that grammatical 
rules can take is restricted (see Section 3.1.4). Elements moved by transformations are 
still represented in their original position, which makes them recoverable at the original 
position and hence the necessary information is available for semantic interpretation. 
There are also more general principles, which serve to restrict transformations. 

After some initial remarks on the model assumed for language acquisition in GB the- 
ory, we will take a closer look at phrase structure rules, transformations and constraints. 


3.1.2 The hypothesis regarding language acquisition: Principles & 
Parameters 


Chomsky (1965: Section I.8) assumes that linguistic knowledge must be innate since the 
language system is, in his opinion, so complex that it would be impossible to learn a 
language from the given input using more general cognitive principles alone (see also 
Section 13.8). If it is not possible to learn language solely through interaction with our 
environment, then at least part of our language ability must be innate. The question 
of exactly what is innate and if humans actually have an innate capacity for language 
remains controversial and the various positions on the question have changed over the 
course of the last decades. Some notable works on this topic are Pinker (1994), Tomasello 
(1995), Wunderlich (2004), Hauser, Chomsky & Fitch (2002), Chomsky (2007), and Pul- 
lum & Scholz (2001) and other papers in the same volume. For more on this discussion, 
see Chapter 13. 

Chomsky (1981a) also assumes that there are general, innate principles which linguis- 
tic structure cannot violate. These principles are parametrized, that is, there are options. 
Parameter settings can differ between languages. An example for a parametrized princi- 
ple is shown in (6): 


(6) Principle: A head occurs before or after its complement(s) depending on the value 
of the parameter POSITION. 


The Principles & Parameters model (P&P model) assumes that a significant part of lan- 
guage acquisition consists of extracting enough information from the linguistic input in 
order to be able to set parameters. Chomsky (2000: 8) compares the setting of parameters 
to flipping a switch. For a detailed discussion of the various assumptions about language 


?For more on the power of formal languages, see Chapter 17. 
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acquisition in the P&P-model, see Chapter 21.6. Speakers of English have to learn that 
heads occur before their complements in their language, whereas a speaker of Japanese 
has to learn that heads follow their complements. (7) gives the respective examples: 


(7) a. be showing pictures of himself 
b. zibun -no syasin-o mise-te iru 
REFL from picture showing be 


As one can see, the Japanese verb, noun and prepositional phrases are a mirror image of 
the corresponding phrases in English. (8) provides a summary and shows the parametric 
value for the position parameter: 


(8) Language Observation Parameter: head initial 
English Heads occur before complements + 
Japanese Heads occur after complements -= 


Investigating languages based on their differences with regard to certain assumed pa- 
rameters has proven to be a very fruitful line of research in the last few decades and has 
resulted in an abundance of comparative cross-linguistic studies. 

After these introductory comments on language acquisition, the following sections 
will discuss the basic assumptions of GB theory. 


3.1.3 The T model 


Chomsky criticized simple PSGs for not being able to adequately capture certain corre- 
lations. An example of this is the relationship between active and passive sentences. In 
phrase structure grammars, one would have to formulate active and passive rules for 
intransitive, transitive and ditransitive verbs (see the discussion of (1)-(3) above). The 
fact that the passive can otherwise be consistently described as the suppression of the 
most prominent argument is not captured by phrase structure rules. Chomsky there- 
fore assumes that there is an underlying structure, the so-called Deep Structure, and that 
other structures are derived from this. The general architecture of the so-called T model 
is discussed in the following subsections. 


3.1.3.1 D-structure and S-structure 


During the derivation of new structures, parts of the Deep Structure can be deleted or 
moved. In this way, one can explain the relationship between active and passive sen- 
tences. As the result of this kind of manipulation of structures, also called transforma- 
tions, one derives a new structure, the Surface Structure, from the original Deep Struc- 
ture. Since the Surface Structure does not actually mirror the actual use of words in a 
sentence in some versions of the theory, the term S-structure is sometimes used instead 
as to avoid misunderstandings. 


(9) Surface Structure = S-structure 
Deep Structure = D-structure 
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Figure 3.2 gives an overview of the GB architecture: phrase structure rules and the lexi- 
con license the D-structure from which the S-structure is derived by means of transfor- 
mations. S-structure feeds into Phonetic Form (PF) and Logical Form (LF). The model is 


D-structure 
nove a 


S-structure 


"ui 


Deletion rules, Anaphoric rules, 
Filter, phonol.rules rules of quantification and control 


| | 


Phonetic Logical 
Form (PF) Form (LF) 


Figure 3.2: The T model 


referred to as the T-model (or Y-model) because D-structure, S-structure, PF and LF form 
an upside-down T (or Y). We will look at each of these individual components in more 
detail. 

Using phrase structure rules, one can describe the relationships between individual 
elements (for instance words and phrases, sometimes also parts of words). The format 
for these rules is X syntax (see Section 2.5). The lexicon, together with the structure 
licensed by X syntax, forms the basis for D-structure. D-structure is then a syntactic 
representation of the selectional grid (= valence classes) of individual word forms in the 
lexicon. 

The lexicon contains a lexical entry for every word which comprises information 
about morphophonological structure, syntactic features and selectional properties. This 
will be explained in more detail in Section 3.1.3.4. Depending on one’s exact theoreti- 
cal assumptions, morphology is viewed as part of the lexicon. Inflectional morphology 
is, however, mostly consigned to the realm of syntax. The lexicon is an interface for 
semantic interpretation of individual word forms. 

The surface position in which constituents are realized is not necessarily the position 
they have in D-structure. For example, a sentence with a ditransitive verb has the fol- 
lowing ordering variants: 


(10) a. [dass] der Mann der Frau das Buch gibt 
that the.nom man the.DAT woman the.acc book gives 


‘that the man gives the woman the book’ 
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b. Gibt der Mann der Frau das Buch? 
gives the.nom man the.DAT woman the.acc book 


‘Does the man give the woman the book?’ 


c. Der Mann gibt der Frau das Buch. 
the.nom man gives the DAT woman the.acc book 


‘The man gives the woman the book’ 


The following transformational rules for the movements above are assumed: (10b) is 
derived from (10a) by fronting the verb, and (10c) is derived from (10b) by fronting the 
nominative noun phrase. In GB theory, there is only one very general transformation: 
Move a = “Move anything anywhere!”. The nature of what exactly can be moved where 
and for which reason is determined by principles. Examples of such principles are the 
Theta-Criterion and the Case Filter, which will be dealt with below. 

The relations between a predicate and its arguments that are determined by the lexical 
entries have to be accessible for semantic interpretation at all representational levels. For 
this reason, the base position of a moved element is marked with a trace. This means, 
for instance, that the position in which the fronted gibt ‘gives’ originated is indicated in 
(11b). The respective marking is referred to as a trace or a gap. Such empty elements may 
be frightening when one encounters them first, but I already motivated the assumption 
of empty elements in nominal structures in Section 2.4.1 (page 68). 


(11) a. [dass] der Mann der Frau das Buch gibt 
that the man the woman the book gives 


‘that the man gives the woman the book’ 
b. Gibt; der Mann der Frau das Buch ;? 


H 
gives the man the woman the book 


‘Does the man give the woman the book?’ 
c. [Der Mann]; gibt; _; der Frau das Buch ;. 
the man gives the woman the book 


"Ihe man gives the woman the book’ 


(11c) is derived from (11a) by means of two movements, which is why there are two traces 
in (11c). The traces are marked with indices so it is possible to distinguish the moved 
constituents. The corresponding indices are then present on the moved constituents. 
Sometimes, e (for empty) or t (for trace) is used to represent traces. 

The S-structure derived from the D-structure is a surface-like structure but should not 
be equated with the structure of actual utterances. 


3.1.3.2 Phonetic Form 


Phonological operations are represented at the level of Phonetic Form (PF). PF is respon- 
sible for creating the form which is actually pronounced. For example, so-called wanna- 
contraction takes place at PF (Chomsky 1981a: 20-21). 
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(12) a. The students want to visit Paris. 


b. The students wanna visit Paris. 
The contraction in (12) is licensed by the optional rule in (13): 


(13) want + to — wanna 


3.1.3.3 Logical Form 


Logical Form is the syntactic level which mediates between S-structure and the semantic 
interpretation of a sentence. Some of the phenomena which are dealt with by LF are 
anaphoric reference of pronouns, quantification and control. 

Syntactic factors play a role in resolving anaphoric dependencies. An important com- 
ponent of GB theory is Binding Theory, which seeks to explain what a pronoun can 
or must refer to and when a reflexive pronoun can or must be used. (14) gives some 
examples of both personal and reflexive pronouns: 


(14) a. Peter kauft einen Tisch. Er gefällt ihm. 
Peter buys a table(m) he likes him 


‘Peter is buying a table. He likes it/him’ 


b. Peter kauft eine Tasche. Er gefallt ihm. 
Peter buys a bag(F) helikes him 


‘Peter is buying a bag. He likes it/him: 


c. Peter kauft eine Tasche. Er gefallt sich. 
Peter buys a bag(r) he likes himself 


‘Peter is buying a bag. He likes himself? 


In the first example, er ‘he’ can refer to either Peter, the table or something/someone else 
that was previously mentioned in the context. ihm ‘him’ can refer to Peter or someone 
in the context. Reference to the table is restricted by world knowledge. In the second 
example, er ‘he’ cannot refer to Tasche ‘bag’ since Tasche is feminine and er is masculine. 
er ‘he’ can refer to Peter only if ihm ‘him’ does not refer to Peter. ihm would otherwise 
have to refer to a person in the wider context. This is different in (14c). In (14c), er ‘he’ and 
sich ‘himself’ must refer to the same object. This is due to the fact that the reference of 
reflexives such as sich is restricted to a particular local domain. Binding Theory attempts 
to capture these restrictions. 

LF is also important for quantifier scope. Sentences such as (15a) have two readings. 
These are given in (15b) and (15c). 


(15) a. Every man loves a woman. 
b. Vx3y(man(x) > (woman(y) ^ love(x, y))) 
c. dyVx(man(x) > (woman(y) ^ love(x, y))) 
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The symbol V stands for a universal quantifier and 3 stands for an existential quantifier. 
The first formula corresponds to the reading that for every man, there is a woman who 
he loves and in fact, these can be different women. Under the second reading, there is 
exactly one woman such that all men love her. The question of when such an ambiguity 
arises and which reading is possible when depends on the syntactic properties of the 
given utterance. LF is the level which is important for the meaning of determiners such 
as aand every. 

Control Theory is also specified with reference to LF. Control Theory deals with the 
question of how the semantic role of the infinitive subject in sentences such as (16) is 


filled. 


(16) a. Der Professor schlägt dem Studenten vor, die Klausur noch mal zu 
the professor suggests the student PART the test once again to 
schreiben. 
write 


‘The professor advises the student to take the test again’ 


b. Der Professor schlägt dem Studenten vor, die Klausur nicht zu bewerten. 
the professor suggests the student PART the test not to grade 


‘The professor suggests to the student not to grade the test! 


c. Der Professor schlägt dem Studenten vor, gemeinsamins Kino zu gehen. 
the professor suggests the student PART together into cinema to go 


"Ihe professor suggests to the student to go to the cinema together! 


3.1.3.4 The lexicon 


The meaning of words tells us that they have to be combined with certain roles like 
“acting person” or “affected thing” when creating more complex phrases. For example, 
the fact that the verb beat needs two arguments belongs to its semantic contribution. 
The semantic representation of the contribution of the verb beat in (17a) is given in (17b): 


(17) a. Judit beats the grandmaster. 
b. beat'(x,y) 


Dividing heads into valence classes is also referred to as subcategorization: beat is sub- 
categorized for a subject and an object. This term comes from the fact that a head is 
already categorized with regard to its part of speech (verb, noun, adjective, ...) and then 
further subclasses (e.g., intransitive or transitive verb) are formed with regard to valence 
information. Sometimes the phrase X subcategorizes for Y is used, which means X selects 
Y. beat is referred to as the predicate since beat' is the logical predicate. The subject and 
object are the arguments of the predicate. There are several terms used to describe the set 
of selectional requirements such as argument structure, valence frames, subcategorization 
frame, thematic grid and theta-grid or 0-grid. 
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Adjuncts modify semantic predicates and when the semantic aspect is emphasized 
they are also called modifiers. Adjuncts are not present in the argument structure of 
predicates. 

Following GB assumptions, arguments occur in specific positions in the clause - in so- 
called argument positions (e.g., the sister of an X° element, see Section 2.5). The Theta- 
Criterion states that elements in argument positions have to be assigned a semantic role 
- a so-called theta-role - and each role can be assigned only once (Chomsky 1981a: 36): 


Principle 1 (Theta-Criterion) 
e Each theta-role is assigned to exactly one argument position. 


e Every phrase in an argument position receives exactly one theta-role. 


The arguments of a head are ordered, that is, one can differentiate between higher- and 
lower-ranked arguments. The highest-ranked argument of verbs and adjectives has a 
special status. Since GB assumes that it is often (and always in some languages) realized 
in a position outside of the verb or adjective phrase, it is often referred to as the external 
argument. The remaining arguments occur in positions inside of the verb or adjective 
phrase. These kind of arguments are dubbed internal arguments or complements. For 
simple sentences, this often means that the subject is the external argument. 
When discussing types of arguments, one can identify three classes of theta-roles: 


e Class 1: agent (acting individual), the cause of an action or feeling (stimulus), 
holder of a certain property 


e Class 2: experiencer (perceiving individual), the person profiting from something 
(beneficiary) (or the opposite: the person affected by some kind of damage), pos- 
sessor (owner or soon-to-be owner of something, or the opposite: someone who 
has lost or is lacking something) 


e Class 3: patient (affected person or thing), theme 


If a verb has several theta-roles of this kind to assign, Class 1 normally has the highest 
rank, whereas Class 3 has the lowest. Unfortunately, the assignment of semantic roles 
to actual arguments of verbs has received a rather inconsistent treatment in the litera- 
ture. This problem has been discussed by Dowty (1991), who suggests using proto-roles. 
An argument is assigned the proto-agent role if it has sufficiently many of the proper- 
ties that were identified by Dowty as prototypical properties of agents (e.g., animacy, 
volitionality). 

The mental lexicon contains lexical entries with the specific properties of syntactic 
words needed to use that word grammatically. Some of these properties are the follow- 
ing: 


+ form 


* meaning (semantics) 
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e grammatical features: syntactic word class + morphosyntactic features 
e theta-grid 


(18) shows an example of a lexical entry: 


(18) "form helf- ‘help’ 
semantics helfen’ 
grammatical features verb 
theta-grid 
theta-roles agent beneficiary 
grammatical particularities dative 


Assigning semantic roles to specific syntactic requirements (beneficiary = dative) is also 
called linking. 

Arguments are ordered according to their ranking: the highest argument is furthest 
left. In the case of helfen, the highest argument is the external argument, which is why 
the agent is underlined. With so-called unaccusative verbs,’ the highest argument is not 
treated as the external argument. It would therefore not be underlined in the correspond- 
ing lexical entry. 


3.1.4 X theory 


In GB, it is assumed that all syntactic structures licensed by the core grammar‘ corre- 
spond to the X schema (see Section 2.5).° In the following sections, I will comment on the 
syntactic categories assumed and the basic assumptions with regard to the interpretation 
of grammatical rules. 


3.1.4.1 Syntactic categories 


The categories which can be used for the variable X in the X schema are divided into 
lexical and functional categories. This correlates roughly with the difference between 
open and closed word classes. The following are lexical categories: 


e V = verb 
e N = noun 


e A = adjective 


3See Perlmutter (1978) for a discussion of unaccusative verbs. The term ergative verb is also common, albeit 
a misnomer. See Burzio (1981; 1986) for the earliest work on unaccusatives in the Chomskyan framework 
and Grewendorf (1989) for German. Also, see Pullum (1988) on the usage of these terms and for a historical 
evaluation. 

^Chomsky (1981a: 7-8) distinguishes between a regular area of language that is determined by a grammar 
that can be acquired using genetically determined language-specific knowledge and a periphery, to which 
irregular parts of language such as idioms (e.g., to pull the wool over sb.’s eyes) belong. See Section 16.3. 

°Chomsky (1970: 210) allows for grammatical rules that deviate from the X schema. It is, however, common 
practice to assume that languages exclusively use X structures. 
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* P = preposition/postposition 
* Adv = adverb 


s x " ; : e 6 
Lexical categories can be represented using binary features and a cross-classification: 


Table 3.1: Representation of four lexical categories using two binary features 


-V +V 


-N P=[-N,-V] V=[-N,+V] 
+N N=[+N,-V] A=[+N,+V] 


Adverbs are viewed as intransitive prepositions and are therefore captured by the de- 
composition in the table above. 

Using this cross-classification, it is possible to formulate generalizations. One can, for 
example, simply refer to adjectives and verbs: all lexical categories which are [ +V ] are 
either adjectives or verbs. Furthermore, one can say of [ +N ] categories (nouns and 
adjectives) that they can bear case. 


Apart from this, some authors have tried to associate the head position with the fea- 
ture values in Table 3.1 (see e.g., Grewendorf 1988: 52; Haftka 1996: 124; G. Müller 2011: 
238). With prepositions and nouns, the head precedes the complement in German: 


(19) a. fiir Marie 
for Marie 


b. Bild von Maria 
picture of Maria 


With adjectives and verbs, the head is final: 
(20) a. dem König treu 

the king loyal 

‘Loyal to the king’ 


b. der [dem Kind helfende] Mann 
the the child helping man 


‘the man helping the child’ 


c. dem Mann helfen 
the man help 


‘help the man’ 


$See Chomsky (1970: 199) for a cross-classification of N, A and V, and Jackendoff (1977: Section 3.2) for a 
cross-classification that additionally includes P but has a different feature assignment. 
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This data seems to suggest that the head is final with [ +V ] categories and initial with 
[ -V ] categories. Unfortunately, this generalization runs into the problem that there are 
also postpositions in German. These are, like prepositions, not verbal, but do occur after 
the NP they require: 


(21 a. des Geldes wegen 
the money because 


‘because of the money’ 


b. die Nacht über 
the night during 


‘during the night’ 


Therefore, one must either invent a new category, or abandon the attempt to use binary 
category features to describe ordering restrictions. If one were to place postpositions in 
a new category, it would be necessary to assume another binary feature.’ Since this fea- 
ture can have either a negative or a positive value, one would then have four additional 
categories. There are then eight possible feature combinations, some of which would not 
correspond to any plausible category. 

For functional categories, GB does not propose a cross-classification. Usually, the 
following categories are assumed: 


C  Complementizer (subordinating conjunctions such as dass ‘that’) 
I Finiteness (as well as Tense and Mood); 

also Infl in earlier work (inflection), 

T in more recent work (Tense) 
D Determiner (article, demonstrative) 


3.1.4.2 Assumptions and rules 


In GB, it is assumed that all rules must follow the X format discussed in Section 2.5. In 
other theories, rules which correspond to the X format are used along other rules which 
do not. If the strict version of X theory is assumed, this comes with the assumption of 


"Martin Haspelmath has pointed out that one could assume a rule that moves a post-head argument into a 
pre-head position (see Riemsdijk 1978: 89 for the discussion of a transformational solution). This would be 
parallel to the realization of prepositional arguments of adjectives in German: 


(i a. auf seinen Sohn stolz 
on his son proud 
‘proud of his son’ 


b. stolz auf seinen Sohn 
proudof his son 


But note that the situation is different with postpositions here, while all adjectives that take prepositional 
objects allow for both orders, this is not the case for prepositions. Most prepositions do not allow their 
object to occur before them. It is an idiosyncratic feature of some postpositions that they want to have 
their argument to the left. 
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endocentricity: every phrase has a head and every head is part of a phrase (put more 
technically: every head projects to a phrase). 

Furthermore, as with phrase structure grammars, it is assumed that the branches 
of tree structures cannot cross (Non-Tangling Condition). This assumption is made by 
the majority of theories discussed in this book. There are, however, some variants of 
TAG, HPSG, Construction Grammar, and Dependency Grammar which allow crossing 
branches and therefore discontinuous constituents (Becker, Joshi & Rambow 1991; Reape 
1994; Bergen & Chang 2005; Heringer 1996: 261; Eroms 2000: Section 9.6.2). 

In X theory, one normally assumes that there are at most two projection levels (X’ 
and X"). However, there are some versions of Mainstream Generative Grammar and 
other theories which allow three or more levels (Jackendoff 1977; Uszkoreit 1987). In this 
chapter, I follow the standard assumption that there are two projection levels, that is, 
phrases have at least three levels: 


. X = head 


- X’ = intermediate projection (X, read: X bar) 


- XP = highest projection (= X" = X), also called maximal projection 


3.1.5 CP and IP in English 


Most work in Mainstream Generative Grammar is heavily influenced by previous pub- 
lications dealing with English. If one wants to understand GB analyses of German and 
other languages, it is important to first understand the analyses of English and, for this 
reason, this will be the focus of this section. The CP/IP system is also assumed in LFG 
grammars of English and thus the following section also provides a foundation for un- 
derstanding some of the fundamentals of LFG presented in Chapter 7. 

In earlier work, the rules in (22a) and (22b) were proposed for English sentences 
(Chomsky 1981a: 19). 


(22 a. S— NP VP 
b. S NP Infl VP 


Infl stands for Inflection as inflectional affixes are inserted at this position in the structure. 
The symbol AUX was also used instead of Infl in earlier work, since auxiliary verbs are 
treated in the same way as inflectional affixes. Figure 3.3 on the next page shows a 
sample analysis of a sentence with an auxiliary, which uses the rule in (22b). 

Together with its complements, the verb forms a structural unit: the VP. The con- 
stituent status of the VP is supported by several constituent tests and further differences 
between subjects and objects regarding their positional restrictions. 

The rules in (22) do not follow the X template since there is no symbol on the right- 
hand side of the rule with the same category as one on the left-hand side, that is, there is 
no head. In order to integrate rules like (22) into the general theory, Chomsky (1986a: 3) 
developed a rule system with two layers above the verb phrase (VP), namely the CP/IP 
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IP 
B di 
S NP IW 
SE Ee, SE, Tee 
NP INFL VP I VP 
| | 
MA MA 
OS, pp. 
V NP V NP 
eae Lj | uer 
Ann will read the newspaper Ann will read the newspaper 


Figure 3.3: Sentence with an auxiliary verb Figure 3.4: Sentence with auxiliary verb in 
following Chomsky (1981a: 19) the CP/IP system 


system. CP stands for Complementizer Phrase. The head of a CP can be a complementizer. 
Before we look at CPs in more detail, I will discuss an example of an IP in this new 
system. Figure 3.4 shows an IP with an auxiliary in the I? position. As we can see, this 
corresponds to the structure of the X template: I? is a head, which takes the VP as its 
complement and thereby forms I’. The subject is the specifier of the IP. Another way to 
phrase this is to say that the subject is in the specifier position of the IP. This position is 
usually referred to as SpecIP.° 

The sentences in (23) are analyzed as complementizer phrases (CPs), the complemen- 
tizer is the head: 


(23) a that Ann will read the newspaper 
b. that Ann reads the newspaper 


In sentences such as (23), the CPs do not have a specifier. Figure 3.5 on the next page 
shows the analysis of (23a). 

Yes/no-questions in English such as those in (24) are formed by moving the auxiliary 
verb in front of the subject. 


(24) Will Ann read the newspaper? 


Let us assume that the structure of questions corresponds to the structure of sentences 
with complementizers. This means that questions are also CPs. Unlike the sentences in 
(23), however, there is no subordinating conjunction. In the D-structure of questions, 
the C? position is empty and the auxiliary verb is later moved to this position. Figure 3.6 


*Sometimes SpecIP and similar labels are used in trees (for instance by Haegeman (1994), Meinunger (2000) 
and Lohnstein (2014)). I avoid this in this book since SpecIP, SpecAdvP are not categories like NP or AP or 
AdvP but positions that items of a certain category can take. See Chapter 2 on the phrase structure rules 
that license trees. 
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CP CP 
: : 
a Ge 
— Iu 
v v 
pn V NP 
er en = e EE 
that Ann will a the newspaper will, Ann _; read the newspaper 
Figure 3.5: Complementizer phrase Figure 3.6: Polar question 


shows an analysis of (24). The original position of the auxiliary is marked by the trace 
Lj; Which is coindexed with the moved auxiliary. 

wh-questions are formed by the additional movement of a constituent in front of the 
auxiliary; that is into the specifier position of the CP. Figure 3.7 on the facing page shows 
the analysis of (25): 


(25) What will Ann read? 


As before, the movement of the object of read is indicated by a trace. This is important 
when constructing the meaning of the sentence. The verb assigns some semantic role 
to the element in its object position. Therefore, one has to be able to “reconstruct” the 
fact that what actually originates in this position. This is ensured by coindexation of the 
trace with what. 

Several ways to depict traces are used in the literature. Some authors assume a trace 
instead of the object NP as in Figure 3.8a (Grewendorf 1988: 249, 322; Haegeman 1994: 
420). Others have the object NP in the tree and indicate the movement by a trace that is 
dominated by the NP as in Figure 3.8b (von Stechow & Sternefeld 1988: 376; Grewendorf 
1988: 185; Haegeman 1994: 355; Sternefeld 2006: 333). The first proposal directly reflects 
the assumption that a complete phrase is moved and leaves a trace that represents the 
thing that is moved. If one thinks about the properties of the trace it is clear that it has 
the same category as the element that was at this position before movement. Hence the 
second way to represent the moved category is appropriate as well. Figure 3.8b basically 
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CP 
E dicm 
NP, €: 
ger. 
C IP 
"dw 
NP I 
I VP 
| 
VI 
ZU 
V NP 
| | 


what will, Ann A read _; 


Figure 3.7: wh-question 


MA MA MA 
ER es SC 
V ü V NP V NP; 
| | | | | 
read read i read 
(a) Trace (b) XP with empty daughter (c) Mix of a and b 


Figure 3.8: Alternative ways of depicting movement: the moved constituent can be rep- 
resented by a trace or by an XP dominating a trace 


says that the object that is moved is an NP but that there is nothing to pronounce. Given 
what was just said the most appropriate way to represent movement would be the one in 
Figure 3.8c. This picture is a mix of the two other pictures. The index is associated with 
the category and not with the empty phonology. In my opinion this best depicts the fact 
that trace and filler are related. However, I never saw this way of depicting movement 
in the GB literature and hence I will stick to the more common notation in Figure 3.8b. 
This way to depict movement is also more similar to the representation that is used by all 
authors for the movement of words (so-called head-movement). For example the trace 
_%, Which stands for a moved I? in Figure 3.6 is never depicted as daughter of I’ but 
always as a daughter of I’. 
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Until now, Ihave not yet discussed sentences without auxiliaries such as (23b). In order 
to analyze this kind of sentences, it is usually assumed that the inflectional affix is present 
in the I° position. An example analysis is given in Figure 3.9. Since the inflectional affix 


IP 
P 7 Sei, 
NP i 
SE, 
I VP 
| 
V , 
E dac 
V NP 


MAS RUD ac NO 


Ann -s read- the newspaper 
Figure 3.9: Sentence without auxiliary 


precedes the verb, some kind of movement operation still needs to take place. There are 
two suggestions in the literature: one is to assume lowering, that is, the affix moves down 
to the verb (Pollock 1989: 394; Chomsky 1991; Haegeman 1994: 110, 601; Sportiche et al. 
2013). The alternative is to assume that the verb moves up to the affix (Fanselow & Felix 
1987: 258-259). Since theories with lowering of inflectional affixes are complicated for 
languages in which the verb ultimately ends up in C (basically in all Germanic languages 
except English), I follow Fanselow & Felix's (1987: 258-259) suggestion for English and 
Grewendorf's (1993: 1289) suggestion for German and assume that the verb moves from 
V to I in English and from V to I to C in German.” 


?Sportiche, Koopman & Stabler (2013) argue for an affix lowering approach by pointing out that approaches 
assuming that the verb stem moves to I (their T) predict that adverbs appear to the right of the verb rather 
than to the left: 


DI a. John will carefully study Russian. 
b. John carefully studies Russian. 


c. * John studies carefully Russian. 


If the affix -s is in the position of the auxiliary and the verb moves to the affix, one would expect (i.c) to be 
grammatical rather than (i.b). 

A third approach is to assume empty I heads for present and past tense and have these heads select a 
fully inflected verb. See Carnie (2013: 220-221) for such an approach to English. 

For German it was also suggested not to distinguish between I and V at all and treat auxiliaries like 
normal verbs (see footnote 10 below). In such approaches verbs are inflected as V, no I node is assumed 
(Haider 1993; 19972). 
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Following this excursus on the analysis of English sentences, we can now turn to 
German. 


3.1.6 The structure of the German clause 


The CP/IP model has been adopted by many scholars for the analysis of German.” The 
categories C, I and V, together with their specifier positions, can be linked to the topo- 
logical fields as shown in Figure 3.10. 


C’ 


— -—ü c 


IP 
Y 
SE 
XP I 


XP C 


SpecCP IP (without I, V ) VI 
prefield middle field right SB 


SpecIP phrases inside 
subject position | the VP 


Figure 3.10: CP, IP and VP and the topological model of German 


Note that SpecCP and SpecIP are not category symbols. They do not occur in gram- 
mars with rewrite rules. Instead, they simply describe positions in the tree. 

As shown in Figure 3.10, it is assumed that the highest argument of the verb (the sub- 
ject in simple sentences) has a special status. It is taken for granted that the subject 
always occurs outside of the VP, which is why it is referred to as the external argument. 
The VP itself does not have a specifier. In more recent work, however, the subject is 


For GB analyses without IP, see Bayer & Kornfilt (1989), Höhle (1991a: 157), Haider (1993; 19972) and Sterne- 
feld (2006: Section IV.3). Haider assumes that the function of I is integrated into the verb. In LFG, an 
IP is assumed for English (Bresnan 2001: Section 6.2; Dalrymple 2001: Section 3.2.1), but not for German 
(Berman 2003a: Section 3.2.3.2). In HPSG, no IP is assumed. 
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generated in the specifier of the VP (Fukui & Speas 1986; Koopman & Sportiche 1991). 
In some languages, it is assumed that it moves to a position outside of the VP. In other 
languages such as German, this is the case at least under certain conditions (e.g., definite- 
ness, see Diesing 1992). Iam presenting the classical GB analysis here, where the subject 
is outside the VP. All arguments other than the subject are complements of the V, that 
are realized within the VP, that is, they are internal arguments. If the verb requires just 
one complement, then this is the sister of the head V° and the daughter of V’ according 
to the X schema. The accusative object is the prototypical complement. 

Following the X template, adjuncts branch off above the complements of V’. The 
analysis of a VP with an adjunct is shown in Figure 3.11. 


(26) weil der Mann morgen den Jungen trifft 
because the man tomorrow the boy meets 


‘because the man is meeting the boy tomorrow’ 


VP 
| 
V , 
n Ac o 
AdvP v 
pam 
NP V 


morgen den Jungen triff- 


tomorrow theboy meet 


Figure 3.11: Analysis of adjuncts in GB theory 


3.2 Verb position 


In German, the position of the heads of VP and IP (V^ and I°) are to the right of their 
complements and V° and I° form part of the right sentence bracket. The subject and 
all other constituents (complements and adjuncts) all occur to the left of V? and I? and 
form the middle field. It is assumed that German - at least in terms of D-structure - is 
an SOV language (= a language with the base order Subject-Object-Verb). The analysis 
of German as an SOV language is almost as old as Transformational Grammar itself. It 
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was originally proposed by Bierwisch (1963: 34)." Unlike German, Germanic languages 
like Danish, English and Romance languages like French are SVO languages, whereas 
Welsh and Arabic are VSO languages. Around 40 % of all languages belong to the SOV 
languages, around 35 % are SVO (Dryer 2013c). 


The assumption of verb-final order as the base order is motivated by the following 
observations: ” 


1. Verb particles form a close unit with the verb. 


(27 a. weil er morgen an-fängt 
because he tomorrow PART-starts 
‘because he is starting tomorrow’ 
b. Er fängt morgen an. 
he starts tomorrow PART 


‘He is starting tomorrow. 


This unit can only be seen in verb-final structures, which speaks for the fact that 
this structure reflects the base order. 


Verbs which are derived from a noun by back-formation (e.g., uraufführen ‘to per- 
form something for the first time’), can often not be divided into their component 
parts and V2 clauses are therefore ruled out (This was first mentioned by Höhle 
(1991b) in unpublished work. The first published source is Haider (1993: 62)): 


(28) a. weil sie das Stück heute ur-auf-führen 
because they the play today PREF-PART-lead 
‘because they are performing the play for the first time today’ 
b. *Sie ur-auf-führen heute das Stück. 
they PREF-PART-lead today the play 


c. “Sie führen heute das Stück ur-auf. 
they lead today the play PREF-PART 


The examples show that there is only one possible position for this kind of verb. 
This order is the one that is assumed to be the base order. 


"Bierwisch attributes the assumption of an underlying verb-final order to Fourquet (1957). A German trans- 
lation of the French manuscript cited by Bierwisch can be found in Fourquet (1970: 117-135). For other 
proposals, see Bach (1962), Reis (1974), Koster (1975) and Thiersch (1978: Chapter 1). Analyses which as- 
sume that German has an underlying SOV pattern were also suggested in GPSG (Jacobs 1986: 110), LFG 
(Berman 1996: Section 2.1.4) and HPSG (Kiss & Wesche 1991; Oliva 1992; Netter 1992; Kiss 1993; Frank 1994; 
Kiss 1995; Feldhaus 1997; Meurers 2000; Miiller 2005b; 2017a). 

For points 1 and 2, see Bierwisch (1963: 34-36). For point 4 see Netter (1992: Section 2.3). 
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Verbs in non-finite clauses and in finite subordinate clauses with a conjunction are 
always in final position (I am ignoring the possibility of extraposing constituents): 


(29) a. Der Clown versucht, Kurt-Martin die Ware zu geben. 
the clown tries Kurt-Martin the goods to give 


"Ihe clown is trying to give Kurt-Martin the goods: 


b. dass der Clown Kurt-Martin die Ware gibt 
that the clown Kurt-Martin the goods gives 


‘that the clown gives Kurt-Martin the goods’ 


. If one compares the position of the verb in German with Danish (Danish is an SVO 


language like English), then one can clearly see that the verbs in German form a 
cluster at the end of the sentence, whereas they occur before any objects in Danish 


that he him seen have must 


. The scope relations of the adverbs in (31) depend on their order: the left-most ad- 


verb has scope over the two following elements." This was explained by assuming 


his point, it should be mentioned that there seem to be exceptions from the rule that modifiers to the 


2. 
3 
(Orsnes 2009a: 146): 
(30) a. dass er ihn gesehen; haben, muss, 
b. at hanmä, have, set; ham 
thathe must have seen him 
“that he must have seen him’ 
4 
the following structure: 
Bart 
left 


take scope over those to their right. Kasper (1994: 47) discusses examples such as (i), which go back to 


Bartsch & Vennemann (1972: 137). 


(i) 


a. Peter Dest gut wegen der Nachhilfestunden. 
Peter reads well because.of the tutoring 


b. Peter liest wegen der Nachhilfestunden gut. 
Peter reads because.of the tutoring well 


‘Peter can read well thanks to the tutoring’ 


As Koster (1975: Section 6) and Reis (1980: 67) have shown, these are not particularly convincing counter- 
examples as the right sentence bracket is not filled in these examples and therefore the examples are not 
necessarily instances of normal reordering inside of the middle field, but could instead involve extraposi- 
tion of the PP. As noted by Koster and Reis, these examples become ungrammatical if one fills the right 
bracket and does not extrapose the causal adjunct: 


(ii) 


a. “Hans hat gut wegen der Nachhilfestunden gelesen. 
Hans has well because.of the tutoring read 


b. Hans hat gut gelesen wegen der Nachhilfestunden. 
Hans has well read ` because of the tutoring 


‘Hans has been reading well because of the tutoring. 


However, the following example from Crysmann (2004: 383) shows that, even with the right bracket occu- 
pied, one can still have an order where an adjunct to the right has scope over one to the left: 
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(31) a. dasser [absichtlich [nicht lacht]] 
that he intentionally not laughs 


‘that he is intentionally not laughing’ 


b. dass er [nicht [absichtlich lacht]] 
that he not intentionally laughs 


‘that he is not laughing intentionally’ 


It is interesting to note that scope relations are not affected by verb position. If 
one assumes that sentences with verb-second order have the underlying structure 
in (31), then this fact requires no further explanation. (32) shows the derived S- 
structure for (31): 


(32) a. Er lacht; [absichtlich [nicht _;]]. 
he laughs intentionally not 


‘He is intentionally not laughing’ 


b. Er lacht, [nicht [absichtlich _,]]. 


he laughs not intentionally 


‘He is not laughing intentionally’ 


After motivating and briefly sketching the analysis of verb-final order, I will now look 
at the CP/IP analysis of German in more detail. C? corresponds to the left sentence 
bracket and can be filled in two different ways: in subordinate clauses introduced by 
a conjunction, the subordinating conjunction (the complementizer) occupies CT as in 
English. The verb remains in the right sentence bracket, as illustrated by (33). 


(33) dass jeder diesen Mann kennt 
that everybody this man knows 
‘that everybody knows this man’ 
Figure 3.12 on the following page gives an analysis of (33). In verb-first and verb-second 


clauses, the finite verb is moved to C° via the p position: Vos sco (Grewendorf 
1993: 1289). Figure 3.13 on page 107 shows the analysis of (34): 


(34) Kennt jeder diesen Mann? 
knows everybody this man 


"Does everybody know this man?’ 


(iii) Da muß es schon erhebliche Probleme mit der Ausrüstung gegeben haben, da wegen 
there must ExpL already serious problems with the equipment given have since because.of 
schlechten Wetters ein Reinhold Messmer niemals aufgäbe. 
bad weathera Reinhold Messmer never would.give.up 
“There really must have been some serious problems with the equipment because someone like 
Reinhold Messmer would never give up just because of some bad weather. 


Nevertheless, this does not change anything regarding the fact that the corresponding cases in (31) and 
(32) have the same scope relations regardless of the position of the verb. The general means of semantic 
composition may well have to be implemented in the same way as in Crysmann’s analysis. 
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CP 
S 
ee 
C IP 
ea © 
NP r 
pos 
T 
dass jeder diesen Mann kenn-; -t 
that everybody this man know- -s 


Figure 3.12: Sentence with a complementizer in C? 


The CH position is empty in the D-structure of (34). Since it is not occupied by a comple- 
mentizer, the verb can move there. 


3.3 Long-distance dependencies 


The SpecCP position corresponds to the prefield and can be filled by any XP in declarative 
clauses in German. In this way, one can derive the sentences in (36) from (35) by moving 
a constituent in front of the verb: 


(35) Gibt der Manndem Kind jetzt den Mantel? 
gives the.voM man  the.par child now the.Acc coat 


‘Is the man going to give the child the coat now?’ 


(36) a. Der Mann gibt dem Kind jetzt den Mantel. 
the.NoM man gives the.pAT child now the.Acc coat 


"Ihe man is giving the child the coat now! 


b. Dem Kind gibt der Mann jetzt den Mantel. 
the.DAT child gives the.nom man now the.Acc coat 
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CP 
| 
Cc 
m PN 
C IP 
Rd ITE d 
NP I’ 
ER 
VP I 
| 
v’ 
AETS 
NP V 
| 


(kenn-; -t);. jeder diesen Mann .; ., 


knows ` everybody this man 


Figure 3.13: Verb position in GB 


c. Den Mantel gibt der Mann dem Kind jetzt. 
the.acc coat givesthe.nom man  the.par child now 


d. Jetzt gibt der Mann dem ` Kind den Mantel. 
now gives the.nom man  the.par child the.acc coat 


Since any constituent can be placed in front of the finite verb, German is treated typo- 
logically as one of the verb-second languages (V2). Thus, it is a verb-second language 
with SOV base order. English, on the other hand, is an SVO language without the V2 
property, whereas Danish is a V2 language with SVO as its base order (see Orsnes 2009a 
for Danish). 

Figure 3.14 on the following page shows the structure derived from Figure 3.13. The 
crucial factor for deciding which phrase to move is the information structure of the sen- 
tence. That is, material connected to previously mentioned or otherwise-known infor- 
mation is placed further left (preferably in the prefield) and new information tends to 
occur to the right. Fronting to the prefield in declarative clauses is often referred to as 
topicalization. But this is rather a misnomer, since the focus (informally: the constituent 
being asked for) can also occur in the prefield. Furthermore, expletive pronouns can 
occur there and these are non-referential and as such cannot be linked to preceding or 
known information, hence expletives can never be topics. 
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CP 
E we 
NP; C’ 
EE en 
C IP 
Eege 
; 
diesen Mann (kenn-; -t); jeder l Se 
this man know--s everybody 


Figure 3.14: Fronting in GB theory 


Transformation-based analyses also work for so-called long-distance dependencies, that 
is, dependencies crossing several phrase boundaries: 


(37) a. [Um zwei Millionen Mark]; soll er versucht haben, [eine 
around two million Deutsche.Marks should he tried have an 
Versicherung _; Zu betriigen]." 


insurance.company to deceive 
‘He apparently tried to cheat an insurance company out of two million Deu- 
tsche Marks: 
b. ,Wer; glaubt er, daß er _, ist?“ erregte sich ein Politiker vom Nil.” 
who believes he thathe is retort REFLa politician from.the Nile 
* "Who does he think he is?”, a politician from the Nile exclaimed’ 
c. Wen, glaubst du, daß ich _; gesehen habe?! 
who believe you that I seen have 


‘Who do you think I saw?’ 


V ta; 04.05.2001, p. 20. 
P Spiegel, 8/1999, p. 18. 
16 Scherpenisse (1986: 84). 
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d. [Gegen ihn], falle es den Republikanern hingegen schwerer, 
against him fall it the Republicans however more.difficult 
[ [ Angriffe ;] zu lancieren]." 
attacks to launch 


‘It is, however, more difficult for the Republicans to launch attacks against 


him! 

The elements in the prefield in the examples in (37) all originate from more deeply em- 
bedded phrases. In GB, it is assumed that long-distance dependencies across sentence 
boundaries are derived in steps (Grewendorf 1988: 75-79), that is, in the analysis of 
(37c), the interrogative pronoun is moved to the specifier position ofthe dass-clause and 
is moved from there to the specifier ofthe matrix clause. The reason for this is that there 
are certain restrictions on movement which must be checked locally. 


3.4 Passive 


Before I turn to the analysis of the passive in Section 3.4.2, the first subsection will 
elaborate on the differences between structural and lexical case. 


3.4.1 Structural and lexical case 


The case of many case-marked arguments is dependent on the syntactic environment in 
which the head ofthe argument is realized. These arguments are referred to as arguments 
with structural case. Case-marked arguments, which do not bear structural case, are said 
to have lexical case.” 


The following are examples of structural case: 


(38) a. Der Installateur kommt. 
De wou plumber comes 


"Ihe plumber is coming: 


b. Der Mann lasst den Installateur kommen. 
the man lets the.acc plumber come 


"Ihe man is getting the plumber to come: 


taz, 08.02.2008, p. 9. 

Furthermore, there is a so-called agreeing case (see page 41) and semantic case. Agreeing case is found 
in predicatives. This case also changes depending on the structure involved, but the change is due to 
the antecedent element changing its case. Semantic case depends on the function of certain phrases (e.g., 
temporal accusative adverbials). Furthermore, as with lexical case of objects, semantic case does not change 
depending on the syntactic environment. For the analysis of the passive, which will be discussed in this 
section, only structural and lexical case will be relevant. 

P Compare Heinz & Matiasek (1994: 200). 

(38b) is a so-called AcI construction. AcI stands for Accusativus cum infinitivo, which means "accusative 
with infinitive”. The logical subject of the embedded verb (kommen ‘to come’ in this case) becomes the 
accusative object of the matrix verb lassen ‘to let’. Examples for AcI-verbs are perception verbs such as 
hóren ‘to hear’ and sehen ‘to see’ as well as lassen ‘to let’. 
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c. das Kommen des Installateurs 
the coming of.the plumber 


“the plumber’s visit’ 
In the first example, the subject is in the nominative case, whereas Installateur ‘plumber’ 
is in accusative in the second example and even in the genitive in the third following 


nominalization. The accusative case of objects is normally structural case. This case 
becomes nominative under passivization: 


(39) a. Karl schlägt den Weltmeister. 
Karl beats the.acc world.champion 
‘Karl beats the world champion, 
b. Der Weltmeister wird geschlagen. 
the.NoM world champion is ^ beaten 
‘The world champion is being beaten’ 


Unlike the accusative, the genitive governed by a verb is a lexical case. The case of a 
genitive object does not change when the verb is passivized. 


(40) a. Wir gedenken der Opfer. 
we remember the.GEN victims 
b. Der Opfer wird gedacht. 
the.GEN victims are remembered 
"Ihe victims are being remembered: 
(40b) is an example of the so-called impersonal passive. Unlike example (39b), where the 


accusative object became the subject, there is no subject in (40b). See Section 1.7.1. 
Similarly, there is no change in case with dative objects: 


(41) a. Der Mann hat ihm geholfen. 

the man has bm poar helped 

"Ihe man has helped him: 

b. Ihm wird geholfen. 

him.paT is helped 

‘He is being helped: 
It still remains controversial as to whether all datives should be treated as lexical or 
whether some or all of the datives in verbal environments should be treated as instances 
of structural case. For reasons of space, I will not recount this discussion but instead 


refer the interested reader to Chapter 14 of Müller (2007a). In what follows, I assume - 
like Haider (1986a: 20) - that the dative is in fact a lexical case. 


3.4.2 Case assignment and the Case Filter 


In GB, it is assumed that the subject receives case from (finite) I and that the case of the 
remaining arguments comes from V (Chomsky 1981a: 50; Haider 1984: 26; Fanselow & 
Felix 1987: 71-73). 
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Principle 2 (Case Principle) 
e V assigns objective case (accusative) to its complement if it bears structural case. 


e When finite, INFL assigns case to the subject. 


The Case Filter rules out structures where case has not been assigned to an NP. 
Figure 3.15 shows the Case Principle in action with the example in (42a).”° 


(42) a. [dass] der Mann der Frau den Jungen zeigt 
that the man the.nAr woman the.Acc boy shows 
‘that the man shows the boy to the woman’ 


b. [dass] der Junge der Frau gezeigt wird 
that the boy.NoM the.DAT woman shown is 


‘that the boy is shown to the woman’ 


| Ban gui ope ees just case 
— ——- just theta-role 
Tue case and theta-role 


der Mann derFrau den Jungen zeig- -t 


the man thewoman theboy show- -s 
Figure 3.15: Case and theta-role assignment in active clauses 


The passive morphology blocks the subject and absorbs the structural accusative. The 
object that would get accusative in the active receives only a semantic role in its base 


20 The figure does not correspond to X theory in its classic form, since der Frau ‘the woman’ is a complement 
which is combined with V’. In classical X theory, all complements have to be combined with V°. This 
leads to a problem in ditransitive structures since the structures have to be binary (see Larson (1988) for a 
treatment of double object constructions). Furthermore, in the following figures the verb has been left in 
V? for reasons of clarity. In order to create a well-formed S-structure, the verb would have to move to its 
affix in I. 
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IP 
NP jo we 
VP I 
Ed 
you REN : 
NP VN 
NP V 
| just case 
; : — —— just theta-role 
der Junge; derFrau _; gezeigtwir- -d | ..... casé and theta-tole 
the boy the woman shown is 


Figure 3.16: Case and theta-role assignment in passive clauses 


position in the passive, but it does not get the absorbed case. Therefore, it has to move 
to a position where case can be assigned to it (Chomsky 1981a: 124). Figure 3.16 shows 
how this works for example (42b). This movement-based analysis works well for English 
since the underlying object always has to move: 


(43) a. The mother gave [the girl] [a cookie]. 
b. [Ihe girl] was given [a cookie] (by the mother). 


c. *It was given [the girl] [a cookie]. 


(43c) shows that filling the subject position with an expletive is not possible, so the object 
really has to move. However, Lenerz (1977: Section 4.4.3) showed that such a movement 
is not obligatory in German: 


(44) a. weil das Mädchen dem Jungen den Ball schenkte 
because the.nom girl the.DAT boy — the.Acc ball gave 


‘because the girl gave the ball to the boy’ 


b. weil dem Jungen der Ball geschenkt wurde 
because he par boy the.nom ball given was 


‘because the ball was given to the boy’ 


c. weil der Balldem Jungen geschenkt wurde 
because the.nom ball the.pAr boy given was 
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In comparison to (44c), (44b) is the unmarked order. der Ball ‘the ball’ in (44b) occurs in 
the same position as den Ball in (44a), that is, no movement is necessary. Only the case 
differs. (44c) is, however, somewhat marked in comparison to (44b). So, if one assumed 
(44c) to be the normal order for passives and (44b) is derived from this by movement 
of dem Jungen ‘the boy’, (44b) should be more marked than (44c), contrary to the facts. 
To solve this problem, an analysis involving abstract movement has been proposed for 
cases such as (44b): the elements stay in their positions, but are connected to the subject 
position and receive their case information from there. Grewendorf (1993: 1311) assumes 
that there is an empty expletive pronoun in the subject position of sentences such as 
(44b) as well as in the subject position of sentences with an impersonal passive such as 
a 


(45) weil heute nicht gearbeitet wird 
because today not worked is 


‘because there will be no work done today’ 


A silent expletive pronoun is something that one cannot see or hear and that does not 
carry any meaning. For discussion of this kind of empty element, see Section 13.1.3 and 
Chapter 19. 

In the following chapters, I describe alternative treatments of the passive that do with- 
out mechanisms such as empty elements that are connected to argument positions and 
that seek to describe the passive in a more general, cross-linguistically consistent man- 
ner as the suppression of the most prominent argument. 

A further question which needs to be answered is why the accusative object does not 
receive case from the verb. This is captured by a constraint, which goes back to Burzio 
(1986: 178-185) and is therefore referred to as Burzio’s Generalization.” 


(46) Burzio’s Generalization (modified): 
If V does not have an external argument, then it does not assign (structural) ac- 
cusative case. 


?! See Koster (1986: 11-12) for a parallel analysis for Dutch as well as Lohnstein (2014) for a movement-based 
account of the passive that also involves an empty expletive for the analysis of the impersonal passive. 

22 Burzio’s original formulation was equivalent to the following: a verb assigns accusative if and only if it 
assigns a semantic role to its subject. This claim is problematic from both sides. In (i), the verb does not 
assign a semantic role to the subject; however there is nevertheless accusative case: 


(i) Mich friert. 
me.Acc freezes 
‘I am freezing: 
One therefore has to differentiate between structural and lexical accusative and modify Burzio’s General- 


ization accordingly. The existence of verbs like begegnen ‘to bump into’ is problematic for the other side 
of the implication. begegnen has a subject but still does not assign accusative but rather dative: 


(ii) Peter begegnete einem Mann. 
Peter met a.DAT man 


‘Peter met a man: 


See Haider (1999) and Webelhuth (1995: 89) as well as the references cited there for further problems 
with Burzio’s Generalization. 
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Koster (1986: 12) has pointed out that the passive in English cannot be derived by Case 
Theory since if one allowed empty expletive subjects for English as well as German and 
Dutch, then it would be possible to have analyses such as the following in (47) where np 
is an empty expletive: 


(47) np was read the book. 


Koster rather assumes that subjects in English are either bound by other elements (that is, 
non-expletive) or lexically filled, that is, filled by visible material. Therefore, the structure 
in (47) would be ruled out and it would be ensured that the book would have to be placed 
in front of the finite verb so that the subject position is filled. 


3.5 Local reordering 


Arguments in the middle field can, in principle, occur in an almost arbitrary order. (48) 
exemplifies this: 


(48) a. [weil] der Mann der Frau das Buch gibt 
because the man the woman the book gives 


‘because the man gives the book to the woman’ 


b. [weil] der Mann das Buch der Frau gibt 
because the man the book the woman gives 


c. [weil] das Buch der Mann der Frau gibt 
because the book the man the woman gives 


d. [weil] das Buch der Frau der Mann gibt 
because the book the woman the man gives 


e. [weil] derFrau der Mann das Buch gibt 
because the woman the man the book gives 


f. [weil] derFrau das Buch der Mann gibt 
because the woman the book the man gives 


In (48b-f), the constituents receive different stress and the number of contexts in which 
each sentence can be uttered is more restricted than in (48a) (Höhle 1982). The order in 
(48a) is therefore referred to as the neutral order or unmarked order. 

Two proposals have been made for analyzing these orders: the first suggestion as- 
sumes that the five orderings in (48b-f) are derived from a single underlying order by 
means of Move-a (Frey 1993). As an example, the analysis of (48c) is given in Figure 3.17 
on the next page. The object das Buch ‘the book’ is moved to the left and adjoined to the 
topmost IP. 

An argument that has often been used to support this analysis is the fact that scope 
ambiguities exist in sentences with reorderings which are not present in sentences in 
the base order. The explanation of such ambiguities comes from the assumption that 
the scope of quantifiers can be derived from their position in the surface structure as 
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IP 
See EE, 
NP[acc] ; IP 
were Se 
NP[nom] D 
LO OS 
VP I 


, 


V 
EXE 


NP[dat] VI 
NP V 
das Buch der Mann derFrau _; gib- -t 
thebook theman the woman give- -s 


Figure 3.17: Analysis of local reordering as adjunction to IP 


well as their position in the deep structure. If the position in both the surface and deep 
structure are the same, that is, when there has not been any movement, then there is 
only one reading possible. If movement has taken place, however, then there are two 
possible readings (Frey 1993: 185): 


(49) a. Esist nicht der Fall, daß er mindestens einem Verleger fast jedes Gedicht 
it is not the case that he at.least one publisher almost every poem 
anbot. 


offered 


‘It is not the case that he offered at least one publisher almost every poem. 

b. Esist nicht der Fall, daß er fast jedes Gedicht; mindestens einem Verleger 
it is not thecasethathealmosteverypoem atleast one publisher 
_; anbot. 


offered 


‘It is not the case that he offered almost every poem to at least one publisher’ 
It turns out that approaches assuming traces run into problems as they predict certain 
readings for sentences with multiple traces which do not exist (see Kiss 2001: 146 and 


Fanselow 2001: Section 2.6). For instance in an example such as (50), it should be possible 
to interpret mindestens einem Verleger ‘at least one publisher’ at the position of. ;, which 
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would lead to a reading where fast jedes Gedicht ‘almost every poem’ has scope over 
mindestens einem Verleger ‘at least one publisher’. However, this reading does not exist. 


(50) Ich glaube, dass mindestens einem Verleger; fast jedes Gedicht, nur dieser 
I believe that at.least one publisher almost every poem only this 
Dichter _; = angeboten hat. 
poet offered has 


‘I think that only this poet offered almost every poem to at least one publisher! 


Sauerland & Elbourne (2002: 308) discuss analogous examples from Japanese, which 
they credit to Kazuko Yatsushiro. They develop an analysis where the first step is to 
move the accusative object in front of the subject. Then, the dative object is placed in 
front of that and then, in a third movement, the accusative is moved once more. The 
last movement can take place to construct either the S-structure? or as a movement to 
construct the Phonological Form. In the latter case, this movement will not have any 
semantic effects. While this analysis can predict the correct available readings, it does 
require a number of additional movement operations with intermediate steps. 

The alternative to a movement analysis is so-called base generation: the starting struc- 
ture generated by phrase structure rules is referred to as the base. One variant of base 
generation assumes that the verb is combined with one argument at a time and each 0- 
role is assigned in the respective head-argument configuration. The order in which argu- 
ments are combined with the verb is not specified, which means that all of the orders in 
(48) can be generated directly without any transformations."^ Fanselow (2001) suggested 
such an analysis within the framework of GB.” Note that such a base-generation analy- 
sis is incompatible with an IP approach that assumes that the subject is realized in the 
specifier of IP. An IP approach with base-generation of different argument orders would 
allow the complements to appear in any order within the VP but the subject would be 
first since it is part of a different phrase. So the orders in (51a,b) could be analyzed, but 
the ones in (51c-f) could not: 


(51) a. dass der Mann der Frau ein Buch gibt 
that the.nom man the.DAT woman a.Acc book gives 


b. dass der Mannein Buch der Frau gibt 
that the.nom man  a.Acc book the.DAT woman gives 


The authors are working in the Minimalist framework. This means there is no longer S-structure strictly 
speaking. I have simply translated the analysis into the terms used here. 

? Compare this to the grammar in (6) on page 55. This grammar combines a V and an NP to form a new V. 
Since nothing is said about the case of the argument in the phrase structure rule, the NPs can be combined 
with the verb in any order. 

5 The base generation analysis is the natural analysis in the HPSG framework. It has already been developed 
by Gunji in 1986 for Japanese and will be discussed in more detail in Section 9.4. Sauerland & Elbourne 
(2002: 313-314) claim that they show that syntax has to be derivational, that is, a sequence of syntactic 
trees has to be derived. I am of the opinion that this cannot generally be shown to be the case. There 
is, for example, an analysis by Kiss (2001) which shows that scope phenomena can be explained well by 
constraint-based approaches. 
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c. dass der Frau der Mann ein Buch gibt 
that the.pAT woman the.NoM man  a.Acc book gives 


d. dass der Frau ein Buch der Mann gibt 
that the.pAT woman a.Acc book the.nom man gives 


e. dassein Buch der Frau der Mann gibt 
that a.Acc book the.DAT woman the.NoM man gives 


f. dassein Buch der Mann der Frau gibt 
that a.acc book the. NoM man the.DAT woman gives 


For the discussion of different approaches to describing constituent position, see Fan- 
selow (1993). 


3.6 Summary and classification 


Works in GB and some contributions to the Minimalist Program (see Chapter 4) have led 
to a number of new discoveries in both language-specific and cross-linguistic research. 
In the following, I will focus on some aspects of German syntax. 

The analysis of verb movement developed in Transformational Grammar by Bierwisch 
(1963: 34), Reis (1974), Koster (1975), Thiersch (1978: Chapter 1) and den Besten (1983) has 
become the standard analysis in almost all grammar models (possibly with the exception 
of Construction Grammar and Dependency Grammar). 

The work by Lenerz (1977) on constituent order has influenced analyses in other frame- 
works (the linearization rules in GPSG and HPSG go back to Lenerz' descriptions). Hai- 
der's work on constituent order, case and passive (1984; 1985b; 1985a; 1986a; 1990b; 1993) 
has had a significant influence on LFG and HPSG analyses of German. 

The entire configurationality discussion, that is, whether it is better to assume that the 
subject of finite verbs in German is inside or outside the VP, was important (for instance 
Haider 1982; Grewendorf 1983; Kratzer 1984; 1996; Webelhuth 1985; Sternefeld 1985b; 
Scherpenisse 1986; Fanselow 1987; Grewendorf 1988; Dürscheid 1989; Webelhuth 1990; 
Oppenrieder 1991; Wilder 1991; Haider 1993; Grewendorf 1993; Frey 1993; Lenerz 1994; 
Meinunger 2000) and German unaccusative verbs received their first detailed discussion 
in GB circles (Grewendorf 1989; Fanselow 19922). The works by Fanselow and Frey on 
constituent order, in particular with regard to information structure, have advanced Ger- 
man syntax quite considerably (Fanselow 1988; 1990; 1993; 2000a; 2001; 2003b,c; 2004a; 
Frey 2000; 2001; 2004b; 2005). Infinitive constructions, complex predicates and partial 
fronting have also received detailed and successful treatments in the GB/MP frameworks 
(Bierwisch 1963; Evers 1975; Haider 1982; 1986b; 1990a; 1991; 1993; Grewendorf 1983; 1987; 
1988; den Besten 1985; Sternefeld 1985b; Fanselow 1987; 2002; von Stechow & Sternefeld 
1988; Bayer & Kornfilt 1989; G. Müller 1996a; 1998; Vogel & Steinbach 1998). In the area 
of secondary predication, the work by Winkler (1997) is particularly noteworthy. 

This list of works from subdisciplines of grammar is somewhat arbitrary (it corre- 
sponds more or less to my own research interests) and is very much focused on German. 
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There are, of course, a wealth of other articles on other languages and phenomena, which 
should be recognized without having to be individually listed here. 

In the remainder of this section, I will critically discuss two points: the model of lan- 
guage acquisition of the Principles & Parameters framework and the degree of formal- 
ization inside Chomskyan linguistics (in particular the last few decades and the conse- 
quences this has). Some of these points will be mentioned again in Part II. 


3.6.1 Explaining language acquisition 


One of the aims of Chomskyan research on grammar is to explain language acquisition. 
In GB, one assumed a very simple set of rules, which was the same for all languages 
(X theory), as well as general principles that hold for all languages, but which could be 
parametrized for individual languages or language classes. It was assumed that a pa- 
rameter was relevant for multiple phenomena. The Principles & Parameters model was 
particularly fruitful and led to a number of interesting studies in which commonalities 
and differences between languages were uncovered. From the point of view of language 
acquisition, the idea of a parameter which is set according to the input has often been 
cricitized as it cannot be reconciled with observable facts: after setting a parameter, a 
learner should have immediately mastered certain aspects of that language. Chomsky 
(1986b: 146) uses the metaphor of switches which can be flipped one way or the other. 
As it is assumed that various areas of grammar are affected by parameters, setting one 
parameter should have a significant effect on the rest of the grammar of a given learner. 
However, the linguistic behavior of children does not change in an abrupt fashion as 
would be expected (Bloom 1993: 731; Haider 1993: 6; Abney 1996: 3; Ackerman & Webel- 
huth 1998: Section 9.1; Tomasello 2000; 2003). Furthermore, it has not been possible to 
prove that there is a correlation between a certain parameter and various grammatical 
phenomena. For more on this, see Chapter 16. 

The Principles & Parameters model nevertheless remains interesting for cross-linguis- 
tic research. Every theory has to explain why the verb precedes its objects in English and 
follows them in Japanese. One can name this difference a parameter and then classify 
languages accordingly, but whether this is actually relevant for language acquisition is 
being increasingly called in question. 


3.6.2 Formalization 


In his 1963 work on Transformational Grammar, Bierwisch writes the following:”° 


It is very possible that the rules that we formulated generate sentences which are 
outside of the set of grammatical sentences in an unpredictable way, that is, they 


?6Es ist also sehr wohl möglich, daß mit den formulierten Regeln Sätze erzeugt werden können, die auch 
in einer nicht vorausgesehenen Weise aus der Menge der grammatisch richtigen Sätze herausfallen, die 
also durch Eigenschaften gegen die Grammatikalität verstoßen, die wir nicht wissentlich aus der Unter- 
suchung ausgeschlossen haben. Das ist der Sinn der Feststellung, daß eine Grammatik eine Hypothese 
über die Struktur einer Sprache ist. Eine systematische Überprüfung der Implikationen einer für natürliche 
Sprachen angemessenen Grammatik ist sicherlich eine mit Hand nicht mehr zu bewältigende Aufgabe. Sie 
könnte vorgenommen werden, indem die Grammatik als Rechenprogramm in einem Elektronenrechner 
realisiert wird, so daß überprüft werden kann, in welchem Maße das Resultat von der zu beschreibenden 
Sprache abweicht. 
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violate grammaticality due to properties that we did not deliberately exclude in 
our examination. This is meant by the statement that a grammar is a hypothesis 
about the structure of a language. A systematic check of the implications of a 
grammar that is appropriate for natural languages is surely a task that cannot be 
done by hand any more. This task could be solved by implementing the grammar 
as a calculating task on a computer so that it becomes possible to verify to which 
degree the result deviates from the language to be described. (Bierwisch 1963: 163) 


Bierwisch’s claim is even more valid in light of the empirical progress made in the last 
decades. For example, Ross (1967) identified restrictions for movement and long-distance 
dependencies and Perlmutter (1978) discovered unaccusative verbs in the 70s. For Ger- 
man, see Grewendorf (1989) and Fanselow (1992a). Apart from analyses of these phenom- 
ena, restrictions on possible constituent positions have been developed (Lenerz 1977), as 
well as analyses of case assignment (Yip, Maling & Jackendoff 1987; Meurers 1999c; Prze- 
piörkowski 1999b) and theories of verbal complexes and the fronting of parts of phrases 
(Evers 1975; Grewendorf 1988; Hinrichs & Nakazawa 1994; Kiss 1995; G. Müller 1998; 
Meurers 1999b; Müller 1999b; 2002a; De Kuthy 2002). All these phenomena interact! 
Consider another quote: 


A goal of earlier linguistic work, and one that is still a central goal of the linguistic 
work that goes on in computational linguistics, is to develop grammars that assign 
a reasonable syntactic structure to every sentence of English, or as nearly every 
sentence as possible. This is not a goal that is currently much in fashion in theoret- 
ical linguistics. Especially in Government-Binding theory (GB), the development 
of large fragments has long since been abandoned in favor of the pursuit of deep 
principles of grammar. ‘The scope of the problem of identifying the correct parse 
cannot be appreciated by examining behavior on small fragments, however deeply 
analyzed. Large fragments are not just small fragments several times over - there 
is a qualitative change when one begins studying large fragments. As the range of 
constructions that the grammar accommodates increases, the number of undesired 
parses for sentences increases dramatically. (Abney 1996: 20) 


So, as Bierwisch and Abney point out, developing a sound theory of a large fragment of a 
human language is a really demanding task. But what we aim for as theoretical linguists 
is much more: the aim is to formulate restrictions which ideally hold for all languages or 
at least for certain language classes. It follows from this, that one has to have an overview 
of the interaction of various phenomena in not just one but several languages. This task 
is so complex that individual researchers cannot manage it. This is the point at which 
computer implementations become helpful as they immediately flag inconsistencies in 
a theory. After removing these inconsistencies, computer implementations can be used 
to systematically analyze test data or corpora and thereby check the empirical adequacy 
of the theory (Müller, 1999b: Chapter 22; 2015b; 2014d; Oepen & Flickinger 1998; Bender 
2008b, see Section 1.2). 

More than 60 years after the first important published work by Chomsky, it is appar- 
ent that there has not been one large-scale implemented grammatical fragment on the 
basis of Transformational Grammar analyses. Chomsky has certainly contributed to the 
formalization of linguistics and developed important formal foundations which are still 
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relevant in the theory of formal languages in computer science and in theoretical com- 
putational linguistics (Chomsky 1959). However, in 1981, he had already turned his back 
on rigid formalization: 


I think that we are, in fact, beginning to approach a grasp of certain basic princi- 
ples of grammar at what may be the appropriate level of abstraction. At the same 
time, it is necessary to investigate them and determine their empirical adequacy 
by developing quite specific mechanisms. We should, then, try to distinguish as 
clearly as we can between discussion that bears on leading ideas and discussion 
that bears on the choice of specific realizations of them. (Chomsky 1981a: 2-3) 


This is made explicit in a letter to Natural Language and Linguistic Theory: 


Even in mathematics, the concept of formalization in our sense was not devel- 
oped until a century ago, when it became important for advancing research and 
understanding. I know of no reason to suppose that linguistics is so much more 
advanced than 19th century mathematics or contemporary molecular biology that 
pursuit of Pullum’s injunction would be helpful, but if that can be shown, fine. 
For the present, there is lively interchange and exciting progress without any sign, 
to my knowledge, of problems related to the level of formality of ongoing work. 
(Chomsky 1990: 146) 


This departure from rigid formalization has led to there being a large number of publi- 
cations inside Mainstream Generative Grammar with sometimes incompatible assump- 
tions to the point where it is no longer clear how one can combine the insights of the 
various publications. An example of this is the fact that the central notion of government 
has several different definitions (see Aoun & Sportiche 1983 for an overview). 

This situation has been cricitized repeatedly since the 80s and sometimes very harshly 
by proponents of GPSG (Gazdar, Klein, Pullum & Sag 1985: 6; Pullum 1985; 1989a; Pullum 
1991: 48; Kornai & Pullum 1990). 

The lack of precision and working out of the details?? and the frequent modification 
of basic assumptions” has led to insights gained by Mainstream Generative Grammar 
rarely being translated into computer implementations. There are some implementations 
that are based on Transformational Grammar/GB/MP models or borrow ideas from Main- 
stream Generative Grammar (Petrick 1965; Zwicky, Friedman, Hall & Walker 1965; Kay 
1967; Friedman 1969; Friedman, Bredt, Doran, Pollack & Martner 1971; Plath 1973; Morin 
1973; Marcus 1980; Abney & Cole 1986; Kuhns 1986; Correa 1987; Stabler 1987; 1992; 2001; 
Kolb & Thiersch 1991; Fong 1991; Crocker & Lewin 1992; Lohnstein 1993; Lin 1993; Ford- 
ham & Crocker 1994; Nordgard 1994; Veenstra 1998; Fong & Ginsburg 2012),°° but these 


27 A further definition can be found in Aoun & Lightfoot (1984). This is, however, equivalent to an earlier 
version as shown by Postal & Pullum (1986: 104-106). 

8 See e.g., Kuhns (1986: 550), Crocker & Lewin (1992: 508), Kolb & Thiersch (1991: 262), Kolb (1997: 3) and 
Freidin (1997: 580), Veenstra (1998: 25, 47), Lappin et al. (2000a: 888) and Stabler (2011a: 397, 399, 400) for 
the latter. 

2° See e.g., Kolb (1997: 4), Fanselow (2009) and the quote from Stabler on page 177. 

3°See Fordham & Crocker (1994) for a combination of a GB approach with statistical methods. 
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implementations often do not use transformations or differ greatly from the theoretical 
assumptions of the publications. For example, Marcus (1980: 102-104) and Stabler (1987: 
5) use special purpose rules for auxiliary inversion.” These rules reverse the order of 
John and has for the analysis of sentences such as (52a) so that we get the order in (52b), 
which is then parsed with the rules for non-inverted structures. 


(52) a. Has John scheduled the meeting for Wednesday? 
b. John has scheduled the meeting for Wednesday? 


These rules for auxiliary inversion are very specific and explicitly reference the category 
of the auxiliary. This does not correspond to the analyses proposed in GB in any way. 
As we have seen in Section 3.1.5, there are no special transformational rules for auxiliary 
inversion. Auxiliary inversion is carried out by the more general transformation Move-a 
and the associated restrictive principles. It is not unproblematic that the explicit formu- 
lation of the rule refers to the category auxiliary as is clear when one views Stabler’s 
GB-inspired phrase structure grammar: 


(53) a. s— switch(aux_verb,np), vp. 
b. s([First|L0],L,X0,X) :- aux_verb(First), 
np(L0,L1,X0,X1), 
vp([First|L1],L,X1,X). 


The rule in (53a) is translated into the Prolog predicate in (53b). The expression [First|L0] 
after the s corresponds to the string, which is to be processed. The '|'-operator divides 
the list into a beginning and a rest. First is the first word to be processed and LO contains 
all other words. In the analysis of (52a), First is has and LO is John scheduled the meeting 
for Wednesday. In the Prolog clause, it is then checked whether First is an auxiliary 
(aux. verb(First)) and if this is the case, then it will be tried to prove that the list LO 
begins with a noun phrase. Since John is an NP, this is successful. L1 is the sublist of 
L0 which remains after the analysis of L0, that is scheduled the meeting for Wednesday. 
This list is then combined with the auxiliary (First) and now it will be checked whether 
the resulting list has scheduled the meeting for Wednesday begins with a VP. This is the 
case and the remaining list L is empty. As a result, the sentence has been successfully 
processed. 

The problem with this analysis is that exactly one word is checked in the lexicon. 
Sentences such as (54) can not be analyzed: 


(54) Couldor should we pool our capital with that of other co-ops to address the needs 
of a regional *neighborhood"??? 


? Nozohoor-Farshi (1986; 1987) has shown that Marcus' parser can only parse context-free languages. Since 
natural languages are of a greater complexity (see Chapter 17) and grammars of corresponding complexity 
are allowed by current versions of Transformational Grammar, Marcus’ parser can be neither an adequate 
implementation of the Chomskyan theory in question nor a piece of software for analyzing natural lan- 
guage in general. 

32For a discussion that shows that the coordination of lexical elements has to be an option in linguistic 
theories, see Abeillé (2006). 

P http://www.cooperativegrocer.coop/articles/index.php?id=595. 2010-03-28. 
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In this kind of sentence, two modal verbs have been coordinated. They then form an x? 
and - following GB analyses - can be moved together. If one wanted to treat these cases 
as Stabler does for the simplest case, then we would need to divide the list of words 
to be processed into two unlimited sub-lists and check whether the first list contains 
an auxiliary or several coordinated auxiliaries. We would require a recursive predicate 
aux_verbs which somehow checks whether the sequence could or should is a well-formed 
sequence of auxiliaries. This should not be done by a special predicate but rather by 
syntactic rules responsible for the coordination of auxiliaries. The alternative to a rule 
such as (53a) would be the one in (55), which is the one that is used in theories like GPSG 
(Gazdar et al. 1985: 62), LFG (Falk 1984: 491), some HPSG analyses (Ginzburg & Sag 2000: 
36), and Construction Grammar (Fillmore 1999): 


(55) s v(aux+), np, vp. 


This rule would have no problems with coordination data like (54) as coordination of 
multiple auxiliaries would produce an object with the category v(aux+) (for more on 
coordination see Section 21.6.2). If inversion makes it necessary to stipulate a special 
rule like (53a), then it is not clear why one could not simply use the transformation-less 
rule in (55). 

In the MITRE system (Zwicky et al. 1965), there was a special grammar for the surface 
structure, from which the deep structure was derived via reverse application of trans- 
formations, that is, instead of using one grammar to create deep structures which are 
then transformed into other structures, one required two grammars. The deep structures 
that were determined by the parser were used as input to a transformational component 
since this was the only way to ensure that the surface structures can actually be derived 
from the base structure (Kay 2011: 10). 

The REQUEST system by Plath (1973) also used a surface grammar and inverse trans- 
formations to arrive at the deep structure, which was used for semantic interpretation. 

There are other implementations discussed in this chapter that differ from transfor- 
mation-based analyses. For example, Kolb & Thiersch (1991: 265, Section 4) arrive at 
the conclusion that a declarative, constraint-based approach to GB is more appropri- 
ate than a derivational one. Johnson (1989) suggests a Parsing as Deduction approach 
which reformulates sub-theories of GB (X theory, Theta-Theory, Case Theory, ...) as log- 
ical expressions.?^ These can be used independently of each other in a logical proof. In 
Johnson’s analysis, GB theory is understood as a constraint-based system. More general 
restrictions are extracted from the restrictions on S- and D-structure which can then be 
used directly for parsing. This means that transformations are not directly carried out 
by the parser. As noted by Johnson, the language fragment he models is very small. It 
contains no description of wh-movement, for example (p. 114). 

Lin (1993) implemented the parser PrinciParse. It is written in C++ and based on GB 
and Barriers - the theoretical stage after GB (see Chomsky 1986a). The system contains 
constraints like the Case Filter, the Theta-Criterion, Subjacency, the Empty Category 


34See Crocker & Lewin (1992: 511) and Fordham & Crocker (1994: 38) for another constraint-based Parsing- 
as-Deduction approach. 
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Principle and so on. The Theta-Criterion is implemented with binary features +/-theta, 
there is no implementation of Logical Form (p. 119). The system organizes the grammar 
in a network that makes use of the object-oriented organization of C++ programs, that 
is, default-inheritance is used to represent constraints in super and subclasses (Lin 1993: 
Section 5). This concept of inheritance is alien to GB theory: it does not play any role in 
the main publications. The grammar networks license structures corresponding to X the- 
ory, but they code the possible relations directly in the network. The network contains 
categories like IP, Ibar, I, CP, Cbar, C, VP, Vbar, V, PP, PSpec, Pbar, P and so on. This 
corresponds to simple phrase structure grammars that fully specify the categories in the 
rules (see Section 2.2) rather than working with abstract schemata like the ones assumed 
in X theory (see Section 2.5). Furthermore Lin does not assume transformations but uses 
a GPSG-like feature passing approach to nonlocal dependencies (p. 116, see Section 5.4 
on the GPSG approach). 

Probably the most detailed implementation in the tradition of GB and Barriers is Sta- 
bler’s Prolog implementation (1992). Stabler’s achievement is certainly impressive, but 
his book confirms what has been claimed thus far: Stabler has to simply stipulate many 
things which are not explicitly mentioned in Barriers (e.g., using feature-value pairs 
when formalizing X theory, a practice that was borrowed from GPSG) and some as- 
sumptions cannot be properly formalized and are simply ignored (see Briscoe 1997 for 
details). 

GB analyses which fulfill certain requirements can be reformulated so that they no 
longer make use of transformations. These transformation-less approaches are also 
called representational, whereas the transformation-based approaches are referred to as 
derivational. For representational analyses, there are only surface structures augmented 
by traces but none of these structures is connected to an underlying structure by means 
of transformations (see e.g., Koster 1978: 1987: 235; Kolb & Thiersch 1991; Haider 1993: 
Section 1.4; Frey 1993: 14; Lohnstein 1993: 87-88, 177-178; Fordham & Crocker 1994: 
38; Veenstra 1998: 58). These analyses can be implemented in the same way as corre- 
sponding HPSG analyses (see Chapter 9) as computer-processable fragments and this 
has in fact been carried out for example for the analysis of verb position in German.?? 
However, such implemented analyses differ from GB analyses with regard to their ba- 
sic architecture and in small, but important details such as how one deals with the in- 
teraction of long-distance dependencies and coordination (Gazdar 1981b). For a critical 
discussion and classification of movement analyses in Transformational Grammar, see 
Borsley (2012). 

Following this somewhat critical overview, I want to add a comment in order to avoid 
being misunderstood: I do not demand that all linguistic work shall be completely for- 
malized. There is simply no space for this in a, say, thirty page essay. Furthermore, I do 
not believe that all linguists should carry out formal work and implement their analyses 
as computational models. However, there has to be somebody who works out the formal 


35This shows that ten Hacken's contrasting of HPSG with GB and LFG (ten Hacken 2007: Section 4.3) and the 
classification of these frameworks as belonging to different research paradigms is completely mistaken. In 
his classification, ten Hacken refers mainly to the model-theoretic approach that HPSG assumes. However, 
LFG also has a model-theoretic formalization (Kaplan 1995). Furthermore, there is also a model-theoretic 
variant of GB (Rogers 1998). For further discussion, see Chapter 14. 


123 


3 Transformational Grammar - Government & Binding 


details and these basic theoretical assumptions should be accepted and adopted for a 
sufficient amount of time by the research community in question. 


Comprehension questions 


1. Give some examples of functional and lexical categories. 


2. How can one represent lexical categories with binary features and what ad- 
vantages does this have? 


Exercises 
1. Draw syntactic trees for the following examples: 


(56) a. dass die Frau den Mann liebt 
that the.nom woman the.Acc man loves 


‘that the woman loves the man’ 


b. dass der Mann geliebt wird 
that the.nom man loved is 


‘that the man is loved’ 


c. Der Mann wird geliebt. 
the.nom man is loved 


“The man is loved? 


d. dass der Mann der Frau hilft 
that the.Nom man Oe par woman helps 


“that the man helps the woman’ 


e. DerMann hilft der Frau. 
the man.NoM helps the.DAT woman 


"Ihe man is helping the woman: 
For the passive sentences, use the analysis where the subject noun phrase is 


moved from the object position, that is, the analysis without an empty exple- 
tive as the subject. 


124 


3.6 Summary and classification 


Further reading 


For Sections 3.1-3.5, I used material from Peter Gallmann from 2003 (Gallmann 
2003). This has been modified, however, at various points. I am solely responsible 
for any mistakes or inadequacies. For current materials by Peter Gallmann, see 
http://www.syntax-theorie.de. 

In the book Syntaktische Analyseperspektiven, Lohnstein (2014) presents a vari- 
ant of GB which more or less corresponds to what is discussed in this chapter 
(CP/IP, movement-based analysis of the passive). The chapters in said book have 
been written by proponents of various theories and all analyze the same newspa- 
per article. This book is extremely interesting for all those who wish to compare 
the various theories out there. 

Haegeman (1994) is a comprehensive introduction to GB. Those who do read 
German may consider the textbooks by Fanselow & Felix (1987), von Stechow & 
Sternefeld (1988) and Grewendorf (1988) since they are also addressing the phe- 
nomena that are covered in this book. 

In many of his publications, Chomsky discusses alternative, transformation- 
less approaches as “notational variants”. This is not appropriate, as analyses with- 
out transformations can make different predictions to transformation-based ap- 
proaches (e.g., with respect to coordination and extraction. See Section 5.5 for 
a discussion of GPSG in this respect). In Gazdar (1981a), one can find a compar- 
ison of GB and GPSG as well as a discussion of the classification of GPSG as a 
notational variant of Transformational Grammar with contributions from Noam 
Chomsky, Gerald Gazdar and Henry Thompson. 

Borsley (1999b) and Kim & Sells (2008) have parallel textbooks for GB and 
HPSG in English. For the comparison of Transformational Grammar and LFG, 
see Bresnan & Kaplan (1982). Kuhn (2007) offers a comparison of modern deriva- 
tional analyses with constraint-based LFG and HPSG approaches. Borsley (2012) 
contrasts analyses of long-distance dependencies in HPSG with movement-based 
analyses as in GB/Minimalism. Borsley discusses four types of data which are 
problematic for movement-based approaches: extraction without fillers, extrac- 
tion with multiple gaps (see also the discussion of (57) on p.171 and of (55) on 
p.199 of this book), extractions where fillers and gaps do not match and extrac- 
tion without gaps. 
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Like the Government & Binding framework that was introduced in the previous chap- 
ter, the Minimalist framework was initiated by Noam Chomsky at the MIT in Boston. 
Chomsky (1993; 1995b) argued that the problem of language evolution should be taken 
seriously and that the question of how linguistic knowledge could become part of our 
genetic endowment should be answered. To that end he suggested refocusing the theo- 
retical developments towards models that have to make minimal assumptions regarding 
the machinery that is needed for linguistic analyses and hence towards models that as- 
sume less language specific innate knowledge. 

Like GB, Minimalism is wide-spread: theoreticians all over the world are working in 
this framework, so the following list of researchers and institutions is necessarily incom- 
plete. Linguistic Inquiry and Syntax are journals that almost exclusively publish Mini- 
malist work and the reader is referred to these journals to get an idea about who is active 
in this framework. The most prominent researchers in Germany are Artemis Alexiadou, 
Humboldt University Berlin; Günther Grewendorf (2002), Frankfurt am Main; Joseph 
Bayer, Konstanz; and Gereon Müller, Leipzig. 

While innovations like X theory and the analysis of clause structure in GB are highly 
influential and can be found in most of the other theories that are discussed in this book, 
this is less so for the technical work done in the Minimalist framework. It is nevertheless 
useful to familiarize with the technicalities since Minimalism is a framework in which 
a lot of work is done and understanding the basic machinery makes it possible to read 
empirically interesting work in that framework. 

While the GB literature of the 1980s and 1990s shared a lot of assumptions, there was 
an explosion of various approaches in the Minimalist framework that is difficult to keep 
track of. The presentation that follows is based on David Adger’s textbook (Adger 2003). 


4.1 General remarks on the representational format 


The theories that are developed in the framework of the Minimalist Program build on the 
work done in the GB framework. So a lot of things that were explained in the previous 
chapter can be taken over to this chapter. However, there have been some changes in 
fundamental assumptions. The general parametrized principles were dropped from the 
theory and instead the relevant distinctions live in features. Languages differ in the 
values that certain features may have and in addition to this, features may be strong or 
weak and feature strength is also a property that may vary from language to language. 
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Strong features make syntactic objects move to higher positions. The reader is familiar 
with this feature-driven movement already since it was a component of the movement- 
based analysis of the passive in Section 3.4. In the GB analysis of passive, the object had 
to move to the specifier position of IP in order to receive case. Such movements that are 
due to missing feature values are a key component in Minimalist proposals. 


4.1.1 Basic architecture 


Chomsky assumes that there are just two operations (rules) for combining linguistic 
objects: External and Internal Merge. External Merge simply combines two elements 
like the and book and results in a complex phrase. Internal Merge is used to account 
for movement of constituents. It applies to one linguistic object and takes some part of 
this linguistic object and adjoins it to the left of the respective object. The application 
of External Merge and Internal Merge can apply in any order. For instance, two objects 
can be combined with External Merge and then one of the combined items is moved to 
the left by applying Internal Merge. The resulting object can be externally merged with 
another object and so on. As an example consider the Determiner Phrase (DP) in (1): 


(1) the man who we know 


To derive this DP the verb know is externally merged with its object who. After several 
intermediate merges that will be discussed below, know who will be merged with we and 
finally the who is moved to the left by Internal Merge, resulting in who we know. This 
relative clause can be externally merged with man and so on. 

So, Minimalist theories differ from GB in not assuming a Deep Structure that is gener- 
ated by some X grammar and a Surface Structure that is derived from the Deep Structure 
by Move-a. Instead, it is assumed that there is a phase in which External and Internal 
Merge (combination and movement) apply in any order to derive a certain structure that 
is then said to be spelled out. It is said that the structure is sent to the interfaces: the 
articulatory-perceptual system (AP) on the one hand and the conceptual-intentional sys- 
tem (CI) on the other side. AP corresponds to the level of Phonological Form (PF) and 
CI to the level of Logical Form (LF) in GB. The new architecture is depicted in Figure 4.1 
on the facing page (left figure). Syntax is assumed to operate on so-called numerations, 
selections of lexical items that are relevant for a derivation.” Overt syntax stands for 
syntactic operations that usually have a visible effect. After overt syntax the syntactic 
object is sent off to the interfaces and some transformations may take place after this 
Spell-Out point. Since such transformations do not affect pronunciation, this part of 


! Most researchers working in Minimalism follow Abney (1987) in assuming that the determiner rather than 
the noun is the head of nominal structures. Hence sequence like (1) are determiner phrases rather than 
noun phrases. 

“It is unclear to me how numerations are determined. Since empty elements play a crucial role in the 
analysis of sentences in Minimalism and since it is not known which empty elements are needed in an 
actual analysis until the analysis is carried out, there will be infinitely many numerations that potentially 
could be used in the analysis of a given string. Which numeration is chosen and how numerations could 
be integrated in psycholinguistically plausible models of human sentence comprehension is unclear to 
me. Numerations will be ignored in what follows and it will be assumed that lexical items come from the 
lexicon directly. See Hornstein et al. (2005: Section 2.3.2.6) for more on numerations. 
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lexicon —— numeration 


Transfer 
lexicon — enumeration Transfer 
PHON SEM 
Transfer 
overt syntax 
PHON SEM " : 
Spell-Out ransfer 
PHON SEM 
covert syntax 
PHON e SEM 
Y 
LF/CI PF/AP Transfer 
(meaning) (sound) 
PHON SEM 


Figure 4.1: Architecture assumed in Minimalist theories before the Phase model (left) and 
in the Phase model (right) according to Richards (2015: 812, 830) 


syntax is called covert syntax. Like in GB’s LF, the covert syntax can be used to derive 
certain scope readings. 

This architecture was later modified to allow Spell-Out at several points in the deriva- 
tion (right figure). It is now assumed that there are phases in a derivation and that a 
completed phase is spelled out once it is used in a combination with a head (Chomsky 
2008). For instance, a subordinated sentence like that Peter comes in (2) is one phase and 
is sent to the interfaces before the whole sentence is completed.’ 


(2) He believes that Peter comes. 


There are different proposals as to what categories form complete phases. Since the 
concept of phases is not important for the following introduction, I will ignore this con- 
cept in the following. See Section 15.1 on the psycholinguistic plausibility of phases in 
particular and the Minimalist architecture in general. 


4.1.2 Valence, feature checking, and agreement 


The basic mechanism in Minimalist theories is feature checking. For instance, the noun 
letters may have a P feature, which means that it has to combine with a PP in order to 
form a complete phrase. 


3 Andreas Pankau (p.c. 2015) pointed out to me that there is a fundamental problem with such a conception 
of phases, since if it is the case that only elements that are in a relation to a head are send off to the interface 
then the topmost phrase in a derivation would never be sent to the interfaces, since it does not depend on 
any head. 
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(3) letters to him 


It is assumed that there are interpretable and uninterpretable features. An example of 
an interpretable feature is the number feature of nouns. The singular/plural distinction 
is semantically relevant. The category features for part of speech information are purely 
syntactic and hence cannot be interpreted semantically. Minimalism assumes that all 
uninterpretable features have to be used up during the derivation of a complex linguistic 
object. This process of eating up the features is called checking. As an example, let us 
consider the noun letters again. The analysis of (3) is depicted in Figure 4.2. The fact 


N 


DI RS 


letters [N, pl, #2] P 


P i 


to [P, uD] X him [D] 
Figure 4.2: Valence representation via uninterpretable features 


that the P feature of letters is uninterpretable is represented by the little u in front of 
the P. The uninterpretable P feature of letters can be checked against the P feature of to 
him. him is assumed to be of category D since it can appear in the same places as full 
nominal phrases, which are assumed to be DPs here. All checked features are said to 
delete automatically. The deletion is marked by striking the features out in the figures. 
Strings like (4) are ruled out as complete derivations since the D feature of P is not 
checked. This situation is shown in Figure 4.3. 


(4) * letters to 


N 


Du DER 


letters [N, pl, #2] to [P, uD] 
Figure 4.3: Illegitimate syntactic object due to an uninterpretable feature 


If this structure would be used in a larger structure that is spelled out, the derivation 
would crash since the conceptual system could not make sense of the D feature that is 
still present at the P node. 

Selectional features are atomic, that is, the preposition cannot select an DP[acc] as in 
GB and the other theories in this book unless DP[acc] is assumed to be atomic. There- 
fore, an additional mechanism is assumed that can check other features in addition to 
selectional features. This mechanism is called Agree. 
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(5) a. “letters to he 
b. letters to him 


The analysis of (5b) is shown in Figure 4.4. There is an interesting difference between the 


N 


e Ee 


letters [N, pl, #2] P 


E d N 


to [P, 4D, aee] him [D, aee] 
Figure 4.4: Feature checking via Agree 


checking of selectional features and the checking of features via Agree. The features that 
are checked via Agree do not have to be at the top node of the object that is combined 
with a head. This will play a role later in the analysis of the passive and local reordering. 


4.1.3 Phrase structure and X theory 


The projections of X structures were given in Figure 2.9 on page 76. According to early 
versions of the X theory, there could be arbitrarily many complements that were com- 
bined with X? to form an X. Arbitrarily many adjuncts could attach to X and then at 
most one specifier could be combined with the X yielding an XP. Minimalist theories 
assume binary branching and hence there is at most one complement, which is the first- 
merged item. Furthermore, it is not assumed that there is a unique specifier position. 
Chomsky rather assumes that all items that are not complements are specifiers. That 
is, he distinguishes between first-merged (complements) and later-merged items (spec- 
ifiers). Figure 4.5 shows an example with two specifiers. It is also possible to have just 
a complement and no specifier or to have one or three specifiers. What structures are 


XP 
specifier X 
specifier X 


Fi 


complement X 


Figure 4.5: Complements and specifiers in Minimalist theories 
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ultimately licensed depends on the features of the items that are involved in the Merge 
operations. Whether a phrasal projection counts as an X or an XP depends on whether 
the phrase is used as a complement or specifier of another head or whether it is used 
as head in further Merge operations. If a phrase is used as specifier or complement its 
status is fixed to be a phrase (XP), otherwise the projectional status of resulting phrases 
is left underspecified. Lexical head daughters in Merge operations have the category X 
and complex head daughters in Merge operations have the category X. This solves the 
problem that standard X theoretic approaches had with pronouns and proper names: a 
lot of unary branching structure had to be assumed (See left picture in Figure 2.9). This 
is not necessary any longer in current Minimalist theories.* 


4.1.4 Little v 


In Section 3.4, I used X structures in which a ditransitive verb was combined with its 
accusative object to form a V, which was then combined with the dative object to form 
a further V. Such binary branching structures and also flat structures in which both 
objects are combined with the verb to form a V are rejected by many practitioners of 
GB and Minimalism since the branching does not correspond to branchings that would 
be desired for phenomena like the binding of reflexives and negative polarity items. A 
binding in which Benjamin binds himself in (6a) is impossible: 


(6) a. *Emily showed himself Benjamin in the mirror. 


b. Peter showed himself Benjamin in the mirror. 


What is required for the analysis of Binding and NPI phenomena in theories that analyze 
these phenomena in terms of tree configurations is that the reflexive pronoun is “higher” 
in the tree than the proper name Benjamin. More precisely, the reflexive pronoun himself 
has to c-command Benjamin. c-command is defined as follows (Adger 2003: 117)? 


(7) A node A c-commands B if, and only if A's sister either: 
a. is B, or 
b. contains B 


In the trees to the left and in the middle of Figure 4.6 on the next page the c-command 
relations are not as desired: in the left-most tree both DPs c-command each other and in 
the middle one Benjamin c-commands himself rather than the other way round. Hence 
it is assumed that the structures at the left and in the middle are inappropriate and that 
there is some additional structure involving the category v, which is called little v (Adger 
2003: Section 4.4). The sister of himself is V and V contains Benjamin, hence himself c- 
commands Benjamin. Since the sister of Benjamin is V and V neither is nor contains 
himself, Benjamin does not c-command himself. 


^For problems with this approach see Brosziewski (2003: Section 2.1). 
*c-command also plays a prominent role in GB. In fact, one part of Government & Binding is the Binding 
Theory, which was not discussed in the previous chapter since binding phenomena do not play a role in 


this book. 
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y V y 
show himself ^ Benjamin V Benjamin show VP 
show himself himself V 


d 


V Benjamin 


Figure 4.6: Three possible analyses of ditransitives 


The analysis of ditransitives involving an additional verbal head goes back to Larson 
(1988). Hale & Keyser (1993: 70) assume that this verbal head contributes a causative 
semantics. The structure in Figure 4.7 is derived by assuming that the verb show starts 
out in the V position and then moves to the v position. show is assumed to mean see and 
in the position of little v it picks up the causative meaning, which results in a cause-see’ 
meaning (Adger 2003: 133). 


vP 


ER 
E ii 


v+ show VP 


gs. 


himself V 


BE, 


(show) [V] Benjamin 


I 
S 
© 
x 
«I 


Figure 4.7: Analysis of ditransitives involving movement to little v 


While the verb shell analysis with an empty verbal head was originally invented by 
Larson (1988) for the analysis of ditransitive verbs, it is now also used for the analysis of 
strictly transitive and even intransitive verbs. 

Adger (2003: Section 4.5) argues that semantic roles are assigned uniformly in certain 
tree configurations: 


(8 a. DP daughter of vP interpreted as agent 
b. DP daughter of VP — interpreted as theme 
c. PP daughter of v — interpreted as goal 
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Adger assumes that such uniformly assigned semantic roles help in the process of lan- 
guage acquisition and from this, it follows that little v should also play a role in the analy- 
sis of examples with strictly transitive and intransitive verbs. The Figures 4.8 and 4.9 
show the analysis of sentences containing the verbs burn and laugh, respectively. 


burn [V, 4D] Theme 


Figure 4.8: Analysis of strictly transitives involving little v 


) 


v laugh [V] 
Figure 4.9: Analysis of intransitives involving little v 


Adger (2003: 164) assumes that intransitive and transitive verbs move from V to little 
v as well. This will be reflected in the following figures. 


4.1.5 CP, TP, vP, VP 


Section 3.1.5 dealt with the CP/IP system in GB. In the course ofthe development of Min- 
imalism, the Inflectional Phrase was split into several functional projections (Chomsky 
1989) of which only the Tense Phrase is assumed in current Minimalist analyses. So, the 
TP of Minimalism corresponds to IP in the GB analysis. Apart from this change, the core 
ideas of the CP/IP analysis have been transferred to the Minimalist analysis of English. 
This subsection will first discuss special features that are assumed to trigger movement 
(Subsection 4.1.5.1) and then case assignment (Subsection 4.1.5.2). 


51f all intransitive verbs of this type are supposed to have agents as subjects, a very broad conception of 
agent has to be assumed that also subsumes the subject of verbs like sleep. Usually sleeping is not an 
activity that is performed intentionally. 
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4.1.5.1 Features as triggers for movement: The EPP feature on T 


In GB approaches, the modals and auxiliaries were analyzed as members of the category 
I and the subjects as specifiers of IP. In the previous section, I showed how subjects 
are analyzed as specifiers of vP. Now, if one assumes that a modal verb combines with 
such a vP, the subject follows the modal, which does not correspond to the order that 
is observable in English. This problem is solved by assuming a strong uninterpretable D 
feature at T. Since the feature is strong, a suitable D has to move to the specifier of T and 
check the D locally. Figure 4.10 shows the TP that plays a role in the analysis of (9): 


(9) Anna will read the book. 


read v  (read)[V, uD] DP 


pe. 


the book 


Figure 4.10: Analysis of Anna will read the book. involving a modal and movement of the 
subject from v to T 


The Determiner Phrase (DP) the book is the object of read and checks the D feature of 
read. little v selects for the subject Anna. Since T has a strong D feature (marked by an 
asterisk “*’), Anna must not remain inside ofthe vP but moves on to the specifier position 
of TP. 

Full sentences are CPs. For the analysis of (9), an empty C head is assumed that is 
combined with the TP. The empty C contributes a clause type feature Decl. The full 
analysis of (9) is shown in Figure 4.11. 

The analysis of the question in (10) involves an unvalued clause-type feature on T for 
the sentence type question. 


(10) What will Anna read? 
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CP 


FR 


C[Decl] TP 


por Ve 


Anna [D]  T[uD*] 


yer E 


will T[pres] vP 


Pu 


(Anna) v[uB] 


pe uc 


v VP 


Ce 


read v  (read)[V, uD] DP 


dE EE 


the book 


Figure 4.11: Analysis of Anna will read the book. as CP with an empty C with the clause- 
type feature Decl 


The empty complementizer C has a Q feature that can value the clause-type feature on 
T. Since clause-type features on T that have the value Q are stipulated to be strong, the 
T element has to move to C to check the feature locally. In addition, the wh element is 
moved. This movement is enforced by a strong wh feature on C. The analysis of (10) is 
given in Figure 4.12 on the next page. 


4.1.5.2 Case assignment 


In the GB analysis that was presented in Chapter 3, nominative was assigned by (finite) 
I and the other cases by the verb (see Section 3.4.2). The assignment of nominative is 
taken over to Minimalist analyses, so it is assumed that nominative is assigned by (finite) 
T. However, in the Minimalist theory under consideration, there is not a single verb 
projection, but there are two verbal projections: vP and VP. Now, one could assume that 
V assigns accusative to its complement or that v assigns accusative to the complement 
of the verb it dominates. Adger (2003: Section 6.3.2, Section 6.4) assumes the latter 
approach, since it is compatible with the analysis of so-called unaccusative verbs and 
the passive. Figure 4.13 on the facing page shows the TP for (11): 


(11) Anna reads the book. 


The two DPs Anna and the book start out with unvalued uninterpretable case features: 
[ucase:]. The features get valued by T and v. It is assumed that only one feature is 
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CP 
ze Eer 
what ID. wh]  C[za¥h*] 
ero Tan. 
C TP 


read {v (read) [V, uD] (what) 


Figure 4.12: Analysis of What will Anna read? with an empty C with a strong wh feature 


T[pres] vP 


e SE 


(Anna) | v[wB] 


v VP 


D m T dm 


read | v[aece]  (read)[V, 4D] DPlace] 


Pais 


the book 


Figure 4.13: Case assignment by T and v in the TP for of Anna reads the book. 
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checked by Merge, so this would be the D feature on T, leaving the case feature for the 
other available checking mechanism: Agree. Agree can be used to check features in sister 
nodes, but also features further away in the tree. The places that are possible candidates 
for Agree relations have to stand in a certain relation to each other. The first node has to 
c-command the node it Agrees with. c-command roughly means: one node up and then 
arbitrarily many nodes down. So v c-commands VP, V, the DP the book, and all the nodes 
within this DP. Since Agree can value features of c-commanded nodes, the accusative on 
v can value the case feature of the DP the book. 

The non-locality that is build into Agree raises a problem: why is it that (12) is un- 
grammatical? 


(12) * Him likes she. 


The accusative of v could be checked with its subject and the nominative of T with the 
object of likes. Both DPs stand in the necessary c-command relations to T and v. This 
problem is solved by requiring that all Agree relations have to involve the closest possible 
element. Adger (2003: 218) formulates this constraint as follows: 


(13) Locality of matching: Agree holds between a feature F on X and a matching feature 
F on Y if and only if there is no intervening Z[F]. 


Intervention is defined as in (14): 


(14) Intervention: In a structure [X ... Z ... Y], Z intervenes between X and Y iff X 
c-commands Z and Z c-commands Y. 


So, since T may Agree with Anna it must not Agree with the book. Hence nominative 
assignment to she in (12) is impossible and (12) is correctly ruled out. 


4.1.6 Adjuncts 


Adger (2003: Section 4.2.3) assumes that adjuncts attach to XP and form a new XP. 
He calls this operation Adjoin. Since this operation does not consume any features it 
is different from External Merge and hence a new operation would be introduced into 
the theory, contradicting Chomsky’s claim that human languages use only Merge as a 
structure building operation. There are proposals to treat adjuncts as elements in special 
adverbial phrases with empty heads (see Section 4.6.1) that are also assumed to be part 
of a hierarchy of functional projections. Personally, I prefer Adger’s solution that corre- 
sponds to what is done in many other frameworks: there is a special rule or operation 
for the combination of adjuncts and heads (see for instance Section 9.1.7 on the HPSG 
schema for head adjunct combinations). 


4.2 Verb position 


The analysis of verb first sentences in German is straightforward, given the machinery 
that was introduced in the previous section. The basic idea is the same as in GB: the 
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finite verb moves from V to v to T and then to C. The movement to T is forced by a 
strong tense feature on T and the movement of the T complex to C is enforced by a 
clause-type feature on T that is valued as a strong Interrogative feature (Int) or a strong 
Declarative feature (Decl) by C. The analysis of the interrogative clause in (15) is shown 
in Figure 4.14. 


(15) Kennt jeder diesen Mann? 
knows everybody this man 


‘Does everybody know this man?’ 
CP 


C TP 


"da E dn 


Tine]  C[Int] jeder TiD] 


"uio M d 


kennt [Pres*] T[Pres] vP (kennt T) 
a 
(jeder) v 

m ie ccm 

VP v 


DP (kennt) (kennt) v 


qos mem, 


diesen Mann 


Figure 4.14: Analysis of Kennt jeder diesen Mann? “Does everybody know this man?’ 
following the analysis of Adger (2003) 


4.3 Long-distance dependencies 


Having explained the placement of the verb in initial position, the analysis of V2 sen- 
tences does not come with a surprise: Adger (2003: 331) assumes a feature that triggers 
the movement of a constituent to a specifier position of C. Adger calls this feature top, 
but this is a misnomer since the initial position in German declarative sentences is not 
restricted to topics (maybe he would assume an alternative C head with a foc feature). 
Figure 4.15 on the following page shows the analysis of (16): 
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(16) Diesen Mann kennt jeder. 
this man knows everybody 


‘Everbody knows this man! 


CP 


Ius ee 


diesen Mann [top] ` Caen?) 


IR 


C TP 


Pr HP N 


T[Dee*]  C[Decl] jeder  T[wD*] 


uu Dës BET 


kennt [Pres*] T[Pres] vP (kennt T) 
A Des 
(jeder) v 
se en, 
VP v 


(diesen Mann )[D] (kennt) (kennt) v 


Figure 4.15: Analysis of Diesen Mann kennt jeder. “This man, everybody knows: following 
the analysis of Adger (2003: 331) 


As in the verb-initial clause in Figure 4.14 a feature on C triggers verb movement. This 
time it is a Decl feature since we are dealing with a declarative clause. The top feature 
triggers movement of diesen Mann ‘this man’ to the specifier position of C. 


4.4 Passive 


Adger (2003) suggests an analysis for the passive in English, which I adapted here to 
German. Like in the GB analysis that was discussed in Section 3.4 it is assumed that 
the verb does not assign accusative to the object of schlagen ‘to beat’. In Minimalist 
terms, this means that little v does not have an acc feature that has to be checked. This 
special version of little vis assumed to play a role in the analysis of sentences of so-called 
unaccusative verbs (Perlmutter 1978). Unaccusative verbs are a subclass of intransitive 
verbs that have many interesting properties. For instance, they can be used as adjectival 
participles although this is usually not possible with intransitive verbs: 
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(17) a. *der getanzte Mann 
the danced man 


b. der gestorbene Mann 
the died man 


‘the dead man’ 


The explanation of this difference is that adjectival participles predicate over what is the 
object in active sentences: 


(18) a. dass der Mann das Buch gelesen hat 
that the man the book read has 


‘that the man read the book’ 


b. das gelesene Buch 
the read book 


Now the assumption is that the argument of gestorben ‘died’ behaves like an object, while 
the argument of getanzt ‘danced’ behaves like a subject. If adjectival passives predicate 
over the object it is explained why (17b) is possible, while (17a) is not. 

Adger (2003: 140) assumes the structure in Figure 4.16 for vPs with unaccusative verbs. 
It is assumed that this unaccusative variant of little v plays a role in the analysis of the 


vP 


y^ 


v VP 


2 e 


fall[V, uN] Theme 


Figure 4.16: Structure of vP with unaccusative verbs like fall, collapse, wilt according to 
Adger (2003: 140) 


passive. Unaccusative verbs are similar to passivized verbs in that they do have a subject 
that somehow also has object properties. The special version of little v is selected by 
the Passive head werden ‘be’, which forms a Passive Phrase (abbreviated as PassP). See 
Figure 4.17 for the analysis of the example in (19): 


(19) dass er geschlagen wurde 

that he beaten was 

‘that he was beaten’ 
The Pass head requires the Infl feature of little v to have the value Pass, which results 
in participle morphology at spellout. Hence the form that is used is geschlagen ‘beaten’. 


The auxiliary moves to T to check the strong Infl feature at T and since the Infl feature is 
past, the past form of werden ‘be’, namely wurde ‘was’, is used at spellout. T has a nom 
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TP 
Se 
PassP T[past,aem] 
uM n NU AES 
vP (werden) werden [Pass,uInfpast'] ^T[past] 
n cp c 
VP v 


pronoun [nem] (schlagen) schlagen v[wIafl:Pass] 


Figure 4.17: Minimalist analysis of the passive without movement but with nonlocal case 
assignment via Agree 


feature that has to be checked. Interestingly, the Minimalist approach does not require 
the object of schlagen to move to the specifier position of T in order to assign case, 
since case assignment is done via Agree. Hence in principle, the pronominal argument 
of schlagen could stay in its object position and nevertheless get nominative from T. 
This would solve the problem of the GB analysis that was pointed out by Lenerz (1977: 
Section 4.4.3). See page 112 for Lenerz’ examples and discussion ofthe problem. However, 
Adger (2003: 332) assumes that German has a strong EPP feature on T. If this assumption 
is upheld, all problems of the GB account will carry over to the Minimalist analysis: all 
objects have to move to T even when there is no reordering taking place. Furthermore, 
impersonal passives of the kind in (20) would be problematic, since there is no noun 
phrase that could be moved to T in order to check the EPP feature: 


(20) weil getanzt wurde 
because danced was 


‘because there was dancing there’ 


4.5 Local reordering 


Adger (2003) does not treat local reordering. But there are several other suggestions in 
the literature. Since all reorderings in Minimalist theories are feature-driven, there must 
be an item that has a feature that triggers reorderings like those in (21b): 


(21) a. [weil] jeder diesen Mann kennt 
because everyone this man knows 
‘because everyone knows this man’ 
b. [weil] diesen Mann jeder kennt 


because this man everyone knows 


There have been various suggestions involving functional projections like Topic Phrase 
(Laenzlinger 2004: 222) or AgrS and AgrO (Meinunger 2000: Chapter 4) that offer places 
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to move to. G. Müller (2014a: Section 3.5) offers a leaner solution, though. In his ap- 
proach, the object simply moves to a second specifier position of little v. The analysis is 
depicted in Figure 4.18. 


dass vP kennt [T] 


diesen Mann v 


zs 
jeder v 
nm x s c 


VP v 


Rr c OM d 


(diesen Mann) [D] (kennt) (kennt) v 


Figure 4.18: Analysis of dass diesen Mann jeder kennt 'that everybody knows this man' 
as movement of the object to a specifier position of v 


An option that was suggested by Laenzlinger (2004: 229-230) is to assume several 
Object Phrases for objects that may appear in any order. The objects move to the specifier 
positions of these projections and since the order of the Object Phrases is not restricted, 
both orders in (22) can be analyzed: 


(22) a. dass Hans diesen Brief meinem Onkel gibt 
that Hansthis letter my uncle gives 
‘that Hans gives this letter to my uncle’ 


b. dass Hans meinem Onkel diesen Brief gibt 
that Hans my uncle this letter gives 


‘that Hans gives to my uncle this letter’ 


4.6 New developments and theoretical variants 


At the start of the 90s, Chomsky suggested a major rethink of the basic theoretical as- 
sumptions of GB and only keeping those parts of the theory which are absolutely nec- 


7G. Müller assumes optional features on v and V that trigger local reorderings (p. 48). These are not given 
in the figure. 
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essary. In the Minimalist Program, Chomsky gives the central motivations for the far- 
reaching revisions of GB theory (Chomsky 1993; 1995b). Until the beginning of the 90s, it 
was assumed that Case Theory, the Theta-Criterion, X theory, Subjacency, Binding The- 
ory, Control Theory etc. all belonged to the innate faculty for language (Richards 2015: 
804). This, of courses, begs the question of how this very specific linguistic knowledge 
made its way into our genome. The Minimalist Program follows up on this point and at- 
tempts to explain properties of language through more general cognitive principles and 
to reduce the amount of innate language-specific knowledge postulated. The distinction 
between Deep Structure and Surface Structure, for example, was abandoned. Move still 
exists as an operation, but can be used directly to build sub-structures rather than after 
a complete D-structure has been created. Languages differ with regard to whether this 
movement is visible or not. 

Although Chomsky’s Minimalist Program should be viewed as a successor to GB, ad- 
vocates of Minimalism often emphasize the fact that Minimalism is not a theory as such, 
but rather a research program (Chomsky 2007: 4; 2013: 6). The actual analyses sug- 
gested by Chomsky (1995b) when introducing the research program have been reviewed 
by theoreticians and have sometimes come in for serious criticism (Kolb 1997; Johnson 
& Lappin 1997; 1999; Lappin, Levine & Johnson 2000a,b; 2001; Seuren 2004; Pinker & 
Jackendoff 2005), however, one should say that some criticisms overshoot the mark. 

There are various strains of Minimalism. In the following sections, I will discuss some 
of the central ideas and explain which aspects are regarded problematic. 


4.6.1 Move, Merge, feature-driven movement and functional 
projections 


Johnson, Lappin and Kolb have criticized the computational aspects of Chomsky’s sys- 
tem. Chomsky suggested incorporating principles of economy into the theory. In cer- 
tain cases, the grammatical system can create an arbitrary number of structures, but 
only the most economical, that is, the one which requires the least effort to produce, 
will be accepted as grammatical (transderivational economy). This assumption does not 
necessarily have to be taken too seriously and, in reality, does not play a role in many 
works in the Minimalist framework (although see Richards (2015) for recent approaches 
with derivations which are compared in terms of economy). Nevertheless, there are 
other aspects of Chomsky’s theory which can be found in many recent works. For exam- 
ple, Chomsky has proposed reducing the number of basic, structure building operations 
which license structures to two: Move and Merge (that is, Internal and External Merge). 
Move corresponds to the operation Move-a, which was already discussed in Chapter 3, 
and Merge is the combination of (two) linguistic objects. 

It is generally assumed that exactly two objects can be combined (Chomsky 1995b: 
226). For Move, it is assumed that there must be a reason for a given movement op- 
eration. The reason for movement is assumed to be that an element can check some 
feature in the position it is moved to. This idea was already presented in the analysis of 
the passive in Section 3.4: the accusative object does not bear case in passive sentences 
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and therefore has to be moved to a position where it can receive case. This kind of ap- 
proach is also used in newer analyses for a range of other phenomena. For example, it 
is assumed that there are phrases whose heads have the categories focus and topic. The 
corresponding functional heads are always empty in languages like German and English. 
Nevertheless, the assumption of these heads is motivated by the fact that other languages 
possess markers which signal the topic or focus of a sentence morphologically. This ar- 
gumentation is only possible if one also assumes that the inventory of categories is the 
same for all languages. Then, the existence of a category in one language would suggest 
the existence of the same category in all other languages. This assumption of a shared 
universal component (Universal Grammar, UG) with detailed language-specific knowl- 
edge is, however, controversial and is shared by few linguists outside of the Chomskyan 
tradition. Even for those working in Chomskyan linguistics, there have been questions 
raised about whether it is permissible to argue in this way since if it is only the ability to 
create recursive structures that is responsible for the human-specific ability to use lan- 
guage (faculty of language in the narrow sense) - as Hauser, Chomsky & Fitch (2002) 
assume -, then the individual syntactic categories are not part of UG and data from other 
languages cannot be used to motivate the assumption of invisible categories in another 
language. 


4.6.1.1 Functional projections and modularization of linguistic knowledge 


The assumption that movement must be licensed by feature checking has led to an infla- 
tion of the number of (silent) functional heads.? Rizzi (1997: 297) suggests the structure 
in Figure 4.19 on the next page (see also Grewendorf 2002: 85, 240; 2009). 

The functional categories Force, Top, Foc and Fin correspond to clause type, topic, 
focus and finiteness. It is assumed that movement always targets a specifier position. 
Topics and focused elements are always moved to the specifier position of the corre- 
sponding phrase. Topics can precede or follow focused elements, which is why there are 
two topic projections: one above and one below FocP. Topic phrases are recursive, that 
is, an arbitrary number of TopPs can appear at the positions of TopP in the figure. Follow- 
ing Grewendorf (2002: 70), topic and focus phrases are only realized if they are required 
for particular information structural reasons, such as movement.’ Chomsky (1995b: 147) 
follows Pollock (1989) in assuming that all languages have functional projections for 
subject and object agreement as well as negation (AgrS, AgrO, Neg). Sternefeld (1995: 


*The assumption of such heads is not necessary since features can be "bundled" and then they can be checked 
together. For an approach in this vein, which is in essence similar to what theories such as HPSG assume, 
see Sternefeld (2006: Section IL.3.3.4, Section IL4.2). 

In so-called cartographic approaches, it is assumed that every morphosyntactic feature corresponds 
to an independent syntactic head (Cinque & Rizzi 2010: 54, 61). For an explicitly formalized proposal in 
which exactly one feature is consumed during a combination operation see Stabler (2001: 335). Stabler's 
Minimalist Grammars are discussed in more detail in Section 4.6.4. 

* There are differing opinions as to whether functional projections are optional or not. Some authors assume 
that the complete hierarchy of functional projections is always present but functional heads can remain 
empty (e.g., Cinque 1999: 106 and Cinque & Rizzi 2010: 55). 

sce Chomsky (1995b: Section 4.10.1), however. 
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Figure 4.19: Syntactic structure of sentences following Rizzi (1997: 297) 


78), von Stechow (1996: 103) and Meinunger (2000: 100-101, 124) differentiate between 
two agreement positions for direct and indirect objects (AgrO, AgrIO). As well as AgrS, 
AgrO and Neg, Beghelli & Stowell (1997) assume the functional heads Share and Dist in 
order to explain scope phenomena in English as feature-driven movements at LF. For a 
treatment of scope phenomena without empty elements or movement, see Section 19.3. 
Blaszczak & Gartner (2005: 13) assume the categories —PolP, +PolP and %PolP for their 
discussion of polarity. 

Webelhuth (1995: 76) gives an overview of the functional projections that had been 
proposed up to 1995 and offers references for AgrA, AgrN, AgrV, Aux, Clitic Voices, 
Gender, Honorific, jj Number, Person, Predicate, Tense, Z. 

In addition to AdvP, NegP, AgrP, FinP, TopP and ForceP, Wiklund, Hrafnbjargarson, 
Bentzen & Hröarsdöttir (2007) postulate an OuterTopP. Poletto (2000: 31) suggests both 
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a HearerP and a SpeakerP for the position of clitics in Italian. Bosse & Bruening (2011: 
75) assume a BenefactiveP 
Cinque (1999: 106) adopts the 32 functional heads in Table 4.1 in his work. He assumes 


Table 4.1: Functional heads following Cinque (1999: 106) 


1. Mo Odgpeech Act 2. Moodgyatuative 3. Moodgyidential 4. Mo Odgpistemic 

5. T(Past) 6. T(Future) 7. Moody i; 8. Modnecessity 

9. Modpossibility 10. Mod volitional 11. Modopiigation 12. Mod apitity/permission 
13. ASPHsbitual 14. ASP Repetitive(!) 15. ASP Frequentative(1) 16. Asp Celerative(I) 

17. T(Anterior) 18. ASP Terminative 19. Asp Continuative 20. Asp Perfect(?) 
21. ASPRetrospective 22. ASPproximative 23. ASPpurative 24. Asp Generic/progressive 
25. ASP prospective 26. ASpseCompletive(I) 27. ASPpicompletive 28. ASpvoice 
29. Asp Celerative(II) 30. ASpssCompletive(TI) 31. ASP Repetitive(II) 32. Asp Frequentative(II) 


that all sentences contain a structure with all these functional heads. The specifier po- 
sitions of these heads can be occupied by adverbs or remain empty. Cinque claims that 
these functional heads and the corresponding structures form part of Universal Gram- 
mar, that is, knowledge of these structures is innate (page 107). Laenzlinger (2004) fol- 
lows Cinque in proposing this sequence of functional heads for German. He also follows 
Kayne (1994), who assumes that all syntactic structures have the order specifier head 
complement cross-linguistically, even if the surface order of the constituents seems to 
contradict this. 

The constituent orders that are visible in the end are derived by leftward-movement." 
Figure 4.20 on page 149 shows the analysis of a verb-final clause where the functional 


"Table 4.1 shows only the functional heads in the clausal domain. Cinque (1994: 96, 99) also accounts for the 
order of adjectives with a cascade of projections: Quality, Size, Shape, Color, Nationality. These categories 
and their ordering are also assumed to belong to UG (p. 100). 

Cinque (1994: 96) claims that a maximum of seven attributive adjectives are possible and explains this 
with the fact that there are a limited number of functional projections in the nominal domain. As was 
shown on page 65, with a fitting context it is possible to use several adjectives of the same kind, which is 
why some of Cinque’s functional projections would have to be subject to iteration. 

This also counts for extraposition, that is, the movement of constituents into the postfield in German. 
Whereas this would normally be analyzed as rightward-movement, Kayne (1994: Chapter 9) analyzes it as 
movement of everything else to the left. Kayne assumes that (i.b) is derived from (i.a) by moving part of 
the DP: 


(i) a. just walked into the room [pp someone who we don't know]. 


b. Someone; just walked into the room [pp _; who we don't know]. 


(i.a) must have to be some kind of derived intermediate representation, otherwise English would not be 
SV(O) underlyingly but rather V(OJS. (i.a) is therefore derived from (ii) by fronting the VP just walked into 
the room. 


(ii) Someone who we don't know just walked into the room 


Such analyses have the downside that they cannot be easily combined with performance models (see Chap- 
ter 15). 
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adverbial heads have been omitted.? Subjects and objects are generated as arguments 
inside of vP and VP, respectively. The subject is moved to the specifier of the subject 
phrase and the object is moved to the specifier of the object phrase. The verbal pro- 
jection DND) is moved in front of the auxiliary into the specifier position of the phrase 
containing the auxiliary. The only function of SubjP and ObjP is to provide a landing site 
for the respective movements. For a sentence in which the object precedes the subject, 
Laenzlinger assumes that the object moves to the specifier of a topic phrase. Figure 4.20 
contains only a ModP and an AspP, although Laenzlinger assumes that all the heads 
proposed by Cinque are present in the structure of all German clauses. For ditransitive 
verbs, Laenzlinger assumes multiple object phrases (page 230). A similar analysis with 
movement of object and subject from verb-initial VPs to Agr positions was suggested by 
Zwart (1994) for Dutch. 

For general criticism of Kayne's model, see Haider (2000). Haider shows that a Kayne- 
like theory makes incorrect predictions for German (for instance regarding the position 
of selected adverbials and secondary predicates and regarding verbal complex formation) 
and therefore fails to live up to its billing as a theory which can explain all languages. 
Haider (1997a: Section 4) has shown that the assumption of an empty Neg head, as as- 
sumed by Pollock (1989), Haegeman (1995) and others, leads to problems. See Bobaljik 
(1999) for problems with the argumentation for Cinque's cascade of adverb-projections. 

Furthermore, it has to be pointed out that SubjP and ObjP, TraP (Transitive Phrase) and 
IntraP (Intransitive Phrase) (Karimi-Doostan 2005: 1745) and TopP (topic phrase), DistP 
(quantifier phrase), AspP (aspect phrase) (Kiss 2003: 22; Karimi 2005: 35), PathP and Pla- 
ceP (Svenonius 2004: 246) encode information about grammatical function, valence, in- 
formation structure and semantics in the category symbols." In a sense, this is a misuse 
of category symbols, but such a misuse of information structural and semantic categories 
is necessary since syntax, semantics, and information structure are tightly connected 
and since it is assumed that the semantics interprets the syntax, that is, it is assumed 
that semantics comes after syntax (see Figure 3.2 and Figure 4.1). By using semantically 
and pragmatically relevant categories in syntax, there is no longer a clean distinction be- 
tween the levels of morphology, syntax, semantics and pragmatics: everything has been 
‘syntactified’. Rizzi (2014) himself talks about syntactification. He points out that there 
are fundamental problems with the T-model and its current variants in Minimalism and 
concludes that a syntactification in terms of Rizzi-style functional heads or a prolifera- 
tion of T heads with respective features (see also Borsley 2006; Borsley & Müller 2019) 
is the only way to save this architecture. Felix Bildhauer (p. c. 2012) has pointed out to 
me that approaches which assume a cascade of functional projections where the indi- 
vidual aspects of meaning are represented by nodes are actually very close to phrasal 
approaches in Construction Grammar (see Adger 2013: 470 also for a similar view). One 


Ü'fhese structures do not correspond to X theory as it was presented in Section 2.5. In some cases, heads 
have been combined with complements to form an XP rather than an X’. For more on X theory in the 
Minimalist Program, see Section 4.6.3. 

“For further examples and references, see Newmeyer (2004a: 194; 2005: 82). Newmeyer references also 
works which stipulate a projection for each semantic role, e.g., Agent, Reciprocal, Benefactive, Instrumen- 
tal, Causative, Comitative, and Reversive Phrase. 
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CP 
c’ TopP 
DP; SubjP 
DP; ModP 
AdvP ObjP 
DP; NegP 
AdvP AspP 
AdvP MannP 
AdvP AuxP 
VP, Aux+ 
Aux  vP 
DP; VP, 
wahrscheinlich V DP, 
probably 
weil der Mann nicht oft gut gespielt hat 
because | the man not often well played has 
diese Sonate diese Sonate 
this sonata this sonata 


Figure 4.20: Analysis of sentence structure with leftward remnant movement and func- 
tional heads following Laenzlinger (2004: 224) 
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simply lists configurations and these are assigned a meaning (or features which are in- 
terpreted post-syntactically, see Cinque & Rizzi (2010: 62) for the interpretation of TopP, 
for example). 


4.6.1.2 Feature checking in specifier positions 


If one takes the theory of feature checking in Specifier-Head relations to its logical con- 
clusion, then one arrives at an analysis such as the one suggested by Radford (1997: 452). 
Radford assumes that prepositions are embedded in an Agreement Phrase in addition 
to the structure in (23), which is usually assumed, and that the preposition adjoins to 
the head of the Agreement Phrase and the argument of the preposition is moved to the 
specifier position of the Agreement Phrase. 


(23) [p PDP] 


The problem here is that the object now precedes the preposition. In order to rectify this, 
Radford assumes a functional projection p (read little p) with an empty head to which 
the preposition then adjoins. This analysis is shown in Figure 4.21. This machinery is 


pP 
Bere 
p AgrOP 
TS 


P AgrO P D 
| EE 
with 2 me t t t 


Figure 4.21: PP analysis following Radford with case assignment in specifier position and 
little p 


only necessary in order to retain the assumption that feature checking takes place in 
specifier-head relations. If one were to allow the preposition to determine the case of its 
object locally, then all this theoretical apparatus would not be necessary and it would be 
possible to retain the well-established structure in (23). 

Sternefeld (2006: 549—550) is critical of this analysis and compares it to Swiss cheese 
(being full of holes). The comparison to Swiss cheese is perhaps even too positive since, 
unlike Swiss cheese, the ratio of substance to holes in the analysis is extreme (2 words vs. 
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5 empty elements). We have already seen an analysis of noun phrases on page 70, where 
the structure of an NP, which only consisted of an adjective klugen ‘clever’, contained 
more empty elements than overt ones. The difference to the PP analysis discussed here is 
that empty elements are only postulated in positions where overt determiners and nouns 
actually occur. The little p projection, on the other hand, is motivated entirely theory- 
internally. There is no theory-external motivation for any of the additional assumptions 
made for the analysis in Figure 4.21 (see Sternefeld 2006: 549-550). 

A variant of this analysis has been proposed by Hornstein, Nunes & Grohmann (2005: 
124). The authors do without little p, which makes the structure less complex. They 
assume the structure in (24), which corresponds to the AgrOP-subtree in Figure 4.21. 


(24) [Ag DPx [ag Pit Agr [pp t; te ]]] 


The authors assume that the movement of the DP to SpecAgrP happens invisibly, that 
is, covert. This solves Radford’s problem and makes the assumption of pP redundant. 

The authors motivate this analysis by pointing out agreement phenomena in Hun- 
garian: Hungarian postpositions agree with the preceding noun phrase in person and 
number. That is, the authors argue that English prepositional and Hungarian postposi- 
tional phrases have the same structure derived by movement, albeit the movement is 
covert in English. 

In this way, it is possible to reduce the number and complexity of basic operations 
and, in this sense, the analysis is minimal. These structures are, however, still incredibly 
complex. No other kind of theory discussed in this book needs the amount of inflated 
structure to analyze the combination of a preposition with a noun phrase. The struc- 
ture in (24) cannot be motivated by reference to data from English and it is therefore 
impossible to acquire it from the linguistic input. A theory which assumes this kind 
of structures would have to postulate a Universal Grammar with the information that 
features can only be checked in (certain) specifier positions (see Chapters 13 and 16 for 
more on Universal Grammar and language acquisition). For general remarks on (covert) 
movement see Haider (2014: Section 2.3). 


4.6.1.3 Locality of selection and functional projections 


Another problem arises from the use of functional heads to encode linear order. In the 
classic CP/IP-system and all other theories discussed here, a category stands for a class of 
objects with the same distribution, that is, NP (or DP) stands for pronouns and complex 
noun phrases. Heads select phrases with a certain category. In the CP/IP-system, I selects 
a VP and an DP/NP, whereas C selects an IP. In newer analyses, this kind of selectional 
mechanism does not work as easily. Since movement has taken place in (25b), we are 
dealing with a TopP or FocP in das Buch dem Mann zu geben ‘the book the man to give’. 
Therefore, um cannot simply select an non-finite IP, but rather has to disjunctively be 
able to select a TopP, FocP or IP. It has to be ensured that TopPs and FocPs are marked 
with regard to the form of the verb contained inside them, since um can only be combined 
with zu-infinitives. 
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(25) a. um dem Mann das Buch zu geben 
for the man the book to give 


‘to give the man the book’ 


b. um das Buch dem Mann zu geben 
for the book the man to give 


‘to give the book to the man’ 


The category system, selectional mechanisms and projection of features would therefore 
have to be made considerably more complicated when compared to a system which 
simply base generates the orders or a system in which a constituent is moved out of the 
IP, thereby creating a new IP. 

Proposals that follow Cinque (1999) are problematic for similar reasons: Cinque as- 
sumes the category AdverbP for the combination of an adverb and a VP. There is an 
empty functional head, which takes the verbal projection as its complement and the ad- 
verb surfaces in the specifier of this projection. In these systems, adverb phrases have to 
pass on inflectional properties of the verb since verbs with particular inflectional proper- 
ties (finiteness, infinitives with zu, infinitives without zu, participles) have to be selected 
by higher heads (see page 183 and Section 9.1.4). There is of course the alternative to 
use Agree for this, but then all selection would be nonlocal and after all selection is not 
agreement. For further, more serious problems with this analysis like modification of ad- 
verbs by adverbs in connection with partial fronting and restrictions on non-phrasality 
of preverbal adverbials in English, see Haider (1997a: Section 5). 

A special case of the adverb problem is the negation problem: Ernst (1992) studied the 
syntax of negation more carefully and pointed out that negation can attach to several 
different verbal projections (26a,b), to adjectives (26c) and adverbs (26d). 


(26) a. Ken could not have heard the news. 
b. Ken could have not heard the news. 
c. a [not unapproachable] figure 


d. [Not always] has she seasoned the meat. 


If all of these projections are simply NegPs without any further properties (about verb 
form, adjective part of speech, adverb part of speech), it would be impossible to account 
for their different syntactic distributions. Negation is clearly just a special case of the 
more general problem, since adverbs may attach to adjectives forming adjectival phrases 
in the traditional sense and not adverb phrases in Chinque's sense. For instance, the 
adverb oft ‘often’ in (27) modifies lachender ‘laughing’ forming the adjectival phrase oft 
lachender, which behaves like the unmodified adjectival participle lachender: it modifies 
Mann ‘man’ and it precedes it. 


(27) a. ein lachender Mann 
a laughing man 


‘a laughing man’ 
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b. ein oft lachender Mann 
a often laughing man 


“a man that laughs often’ 


Of course one could imagine solutions to the last three problems that use the Agree 
relation to enforce selectional constraints nonlocally, but such accounts would violate 
locality of selection (see Ernst 1992: 110 and the discussion in Section 18.2 of this book) 
and would be much more complicated than accounts that assume a direct selection of 
dependents. 

Related to the locality issues that were discussed in the previous paragraph is the 
assumption of special functional projections for the placement of clitics: if one uses 
SpeakerP so that a clitic for first person singular can be moved to the correct specifier 
positions and a HearerP so that the clitic for second person can be moved to the cor- 
rect position (Poletto 2000: 31), then what one has are special projections which need to 
encode in addition all features that are relevant for clauses (alternatively one could of 
course assume nonlocal Agree to be responsible for distributional facts). In addition to 
these features, the category labels contain information that allows higher heads to select 
clauses containing clitics. In other approaches and earlier variants of transformational 
grammar, selection was assumed to be strictly local so that higher heads only have ac- 
cess to those properties of embedded categories that are directly relevant for selection 
(Abraham 2005: 223; Sag 2007) and not information about whether an argument of a 
head within the clause is the speaker or the hearer or whether some arguments in the 
clause are realized as clitics. Locality will be discussed further in Section 18.2. 


4.6.1.4 Feature-driven movement 


Finally, there is a conceptual problem with feature-driven movement, which has been 
pointed out by Gisbert Fanselow: Frey (2004b: 27) assumes a KontrP (contrastive phrase) 
and Frey (2004a) a TopP (topic phrase) (see Rizzi (1997) for TopP and FocP (focus phrase) 
in Italian and Haftka (1995), Grewendorf (2002: 85, 240); 2009, Abraham (2003: 19), Laen- 
zlinger (2004: 224) and Hinterhölzel (2004: 18) for analyses of German with TopP and/or 
FocP). Constituents have to move to the specifier of these functional heads depending on 
their information structural status. Fanselow (2003a) has shown that such movement- 
based theories for the ordering of elements in the middle field are not compatible with 
current assumptions of the Minimalist Program. The reason for this is that sometimes 
movement takes place in order to create space for other elements (altruistic movement). 
If the information structure of a sentence requires that the closest object to a verb is nei- 
ther focused nor part of the focus, then the object closest to the verb should not receive 
the main stress in the clause. This can be achieved by deaccentuation, that is, by moving 
the accent to another constituent or even, as shown in (28b), by moving the object to a 
different position from the one in which it receives structural stress. 


(28) a. dass die Polizei gestern Linguisten verhaftete 
that the police yesterday linguists arrested 


‘that the police arrested linguists yesterday’ 
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b. dass die Polizei Linguisten gestern verhaftete 
that the police linguists yesterday arrested 


“that the police arrested linguists yesterday’ 


In Spanish, partial focus can be achieved not by special intonation, but rather only by 
altruistic movement in order to move the object out of the focus. See also Bildhauer & 
Cook (2010: p. 72) for a discussion of “altruistic” multiple frontings in German. 

It is therefore not possible to assume that elements are moved to a particular position 
in the tree in order to check some feature motivated by information structural proper- 
ties. Since feature checking is a prerequisite for movement in current minimalist theory, 
one would have to postulate a special feature, which only has the function of triggering 
altruistic movement. Fanselow (2003a: Section 4; 2006: 8) has also shown that the order- 
ing constraints that one assumes for topic, focus and sentence adverbs can be adequately 
described by atheory which assumes firstly, that arguments are combined (in minimalist 
terminology: merged) with their head one after the other and secondly, that adjuncts can 
be adjoined to any projection level. The position of sentence adverbs directly before the 
focused portion of the sentence receives a semantic explanation: since sentence adverbs 
behave like focus-sensitive operators, they have to directly precede elements that they 
refer to. It follows from this that elements which do not belong to the focus of an utter- 
ance (topics) have to occur in front of the sentence adverb. It is therefore not necessary 
to assume a special topic position to explain local reorderings in the middle field. This 
analysis is also pursued in LFG and HPSG. The respective analyses are discussed in more 
detail in the corresponding chapters. 


4.6.2 Labeling 


In the Minimalist Program, Chomsky tries to keep combinatorial operations and mech- 
anisms as simple as possible. He motivates this with the assumption that the existence 
of a UG with less language-specific knowledge is more plausible from a evolutionary 
point of view than a UG which contains a high degree of language-specific knowledge 
(Chomsky 2008: 135). 

For this reason, he removes the projection levels of X theory, traces, indices and “sim- 
ilar descriptive technology” (Chomsky 2008: 138). All that remains is Merge and Move, 
that is, Internal and External Merge. Internal and External Merge combine two syntactic 
objects o and f into a larger syntactic object which is represented as a set { a, P }. æ 
and D can be either lexical items or internally complex syntactic objects. Internal Merge 
moves a part of an object to its periphery.” The result of internally merging an element 
is a set { a, B } where a was a part of f. External Merge also produces a set with two 
elements. However, two independent objects are merged. The objects that are created 
by Merge have a certain category (a set of features). For instance, if one combines the 
elements o and f, one gets (1, { a, B }}, where lis the category of the resulting object. 


"To be more specific, part of a syntactic object is copied and the copy is placed at the edge of the entire 
object. The original of this copy is no longer relevant for pronunciation (Copy Theory of Movement). 
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This category is also called a label. Since it is assumed that all constituents are headed, 
the category that is assigned to { a, f } has to be either the category of o or the category 
of f. Chomsky (2008: 145) discusses the following two rules for the determination of the 
label of a set. 


(29) a. In{H, œ}, Han lL, His the label. 


b. If a is internally merged to f, forming { a, f } then the label of f is the label 
of ( a, fp} 


As Chomsky notes, these rules are not unproblematic since the label is not uniquely de- 
termined in all cases. An example is the combination of two lexical elements. If both H 
and a in (29a) are lexical items (LI), then both H and a can be the label of the resulting 
structure. Chomsky notices that this could result in deviant structures, but claims that 
this concern is unproblematic and ignores it. Chomsky offered a treatment of the combi- 
nation of two lexical items in his 2013 paper. The solution to the problem is to assume that 
all combinations of lexical elements consist of a functional element and a root (Marantz 
1997; Borer 2005). Roots are not considered as labels per definition? and hence the cat- 
egory of the functional element determines the category of the combination (Chomsky 
2013: 47). Such an analysis can only be rejected: the goal of the Minimalist Program is to 
simplify the theoretical proposals to such an extent that the models of language acquisi- 
tion and language evolution become plausible, but in order to simplify basic concepts it 
is stipulated that a noun cannot simply be a noun but needs a functional element to tell 
the noun what category it has. Given that the whole point of Chomsky's Bare Phrase 
Structure (Chomsky 1995a) was the elimination of the unary branching structures in X 
theory, it is unclear why they are reintroduced now through the backdoor, only more 
complex with an additional empty element." Theories like Categorial Grammar and 
HPSG can combine lexical items directly without assuming any auxiliary projections or 
empty elements. See also Rauh (2016) for a comparison of the treatment of syntactic cate- 
gories in earlier versions of Transformational Grammar, HPSG, Construction Grammar, 
Role and Reference Grammar and root-based Neo-Constructivist proposals like the one 
assumed by Chomsky (2013). Rauh concludes that the direct connection of syntactic and 
semantic information is needed and that the Neo-Constructivism of Marantz and Borer 
has to be rejected. For further criticism of Neo-Constructivist approaches see Wechsler 
(2008a) and Müller & Wechsler (2014a: Sections 6.1 and 7). 

The combination of a pronoun with a verbal projection poses a problem that is related 
to what has been said above. In the analysis of He left, the pronoun he is a lexical element 


16 Another category that is excluded as label per definition is Conj, which stands for conjunction (Chomsky 
2013: 45-46). This is a stipulation that is needed to get coordination to work. See below. 
The old X rule in (i.a) corresponds to the binary combination in (i.b). 


(i) a NON 
b. N — N-func root 


In (i.a) a lexical noun is projected to an N’ and in (i.b), a root is combined with a functional nominal head 
into a nominal category. 
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and hence would be responsible for the label of He left, since left is an internally complex 
verbal projection in Minimalist theories. The result would be a nominal label rather than 
a verbal one. To circumvent this problem, Chomsky (2013: 46) assumes that he has a 
complex internal structure: ‘perhaps D-pro’, that is, he is (perhaps) composed out of an 
invisible determiner and a pronoun. 

The case in which two non-LIs are externally merged (for instance a nominal and a 
verbal phrase) is not discussed in Chomsky (2008). Chomsky (2013: 43-44) suggests that 
a phrase XP is irrelevant for the labeling of { XP, YP } if XP is moved (or rather copied 
in the Copy Theory of Movement) in a further step. Chomsky assumes that one of two 
phrases in an { XP, YP } combination has to move, since otherwise labeling would be 
impossible (p.12).^ The following coordination example will illustrate this: Chomsky 
assumes that the expression Z and W is analyzed as follows: first, Z and W are merged. 
This expression is combined with Conj (30a) and in the next step Z is raised (30b). 


(30) a. [e Conj [g ZW]] 
b. [, Z [e Conj le Z W]] 


Since Z in f is only a copy, it does not count for labeling and £ can get the label of W. It 
is stipulated for the combination of Z and a that Conj cannot be the label and hence the 
label of the complete structure is Z.” 

A special case that is discussed by Chomsky is the Internal Merge of an LI e with a 
non LI f. According to rule (29a) the label would be a. According to (29b), the label 
would be f (see also Donati (2006)). Chomsky discusses the combination of the pronoun 
what with you wrote as an example. 


(31) what [C [you wrote d 


If the label is determined according to (29b), one then has a syntactic object that would 
be called a CP in the GB framework; since this CP is, moreover, interrogative, it can 
function as the complement of wonder as in (32a). If the label is determined according to 


Wo explanation is contradictory: on p.11 Chomsky assumes that a label of a combination of two entities 
with the same category is this category. But in his treatment of coordination, he assumes that one of the 
conjuncts has to be raised, since otherwise the complete structure could not be labeled. 

1? As Bob Borsley (p.c. 2013) pointed out to me, this makes wrong predictions for coordinations oftwo singular 
noun phrases with and, since the result of the coordination is a plural DP and not a singular one like the 
first conjunct. Theories like HPSG can capture this by grouping features in bundles that can be shared in 
coordinated structures (syntactic features and nonlocal features, see Pollard & Sag (1994: 202)). 

Furthermore the whole account cannot explain why (i.b) is ruled out. 


(i) a. both Kim and Lee 
b. * both Kim or Lee 


The information about the conjunction has to be part of the representation for or Lee in order to be able to 
contrast it with and Lee. 

A further problem is that the label of a should be the label of W since Conj does not count for label 
determination. This would lead to a situation in which we have to choose between Z and W to determine 
the label of y. Following Chomsky's logic, either Z or W would have to move on to make it possible to 
label y. Chomsky (2013) mentions this problem in footnote 40, but does not provide a solution. 
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(29a), one gets an object that can function as the accusative object of read in (32b), that 
is, something that corresponds to a DP in GB terminology. 


(32) a. I wonder what you wrote. 


b. Iread what you wrote. 


what you wrote in (32b) is a so-called free relative clause. 

Chomsky's approach to free relative clauses is interesting but is unable to describe the 
phenomenon in full breadth. The problem is that the phrase that contains the relative 
pronoun may be complex (contrary to Donati's claims, see also Citko (2008: 930-932) 
for a rejection of Donati's claim).”° (33) provides an English example from Bresnan & 
Grimshaw (1978: 333). German examples from Bausewein (1990: 155) and Müller (1999a: 
78) are given in (34). 


(33) UU read [whichever book] you give me. 


(34) a. Ihr könnt beginnen, [mit wem] ihr wollt.” 
you can start with whom you want 


"You can start with whoever you like’ 


b. [Wessen Birne] noch halbwegs in der Fassung steckt, pflegt solcherlei 
whose bulb/head yet halfway in the socket is uses such 
Erloschene zu meiden;?? 


extinct to avoid 


"Ihose who still have their wits half way about them tend to avoid such vacant 
characters; 


c. [Wessen Schuhe] „danach“ besprenkelt sind, hat keinen Baum gefunden und 
whose shoes after.that speckled are has no tree found and 
war nicht zu einem Bogen in der Lage.” 
was not toa bow in the position 


‘Those whose shoes are spattered afterwards couldn’t find a tree and were 
incapable of peeing in an arc: 


Since wessen Schuhe ‘whose shoes’ is not a lexical item, rule (29b) has to be applied, 
provided no additional rules are assumed to deal with such cases. This means that the 
whole free relative clause wessen Schuhe danach besprenkelt sind is labeled as CP. For 
the free relatives in (33) and (34) the labeling as a CP is an unwanted result, since they 
function as subjects or objects of the matrix predicates and hence should be labelled 
DP. However, since wessen Schuhe is a complex phrase and not a lexical item, (29a) does 
not apply and hence there is no analysis of the free relative clause as a DP. Therefore, 


2° Chomsky (2013: 47) admits that there are many open questions as far as the labeling in free relative clauses 
is concerned and hence admits that there remain many open questions with labeling as such. 

?'Bausewein (1990: 155). 

?" Thomas Gsella, taz, 12.02.1997, p. 20. 

® taz, taz mag, 08./09.08.1998, p. XII. 
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it seems one must return to something like the GB analysis proposed by Groos & van 
Riemsdijk (1981), at least for the German examples. Gross and van Riemsdijk assume 
that free relatives consist of an empty noun that is modified by the relative clause like 
a normal noun. In such an approach, the complexity of the relative phrase is irrelevant. 
It is only the empty head that is relevant for labeling the whole phrase." However, 
once empty heads are countenanced in the analysis, the application of (29a) to (31) is 
undesirable since the application would result in two analyses for (32b): one with the 
empty nominal head and one in which (31) is labeled as DP directly. One might argue 
that in the case of several possible derivations, the most economical one wins, but the 
assumption of transderivational constraints leads to undesired consequences (Pullum 
2013: Section 5). 

Chomsky (2013) abandons the labeling condition in (29b) and replaces it with general 
labeling rules that hold for both internal and external Merge of two phrases. He dis- 
tinguishes two cases. In the first case, labeling becomes possible since one of the two 
phrases of the set { XP, YP } is moved away. This case was already discussed above. 
Chomsky writes about the other case: X and Y are identical in a relevant respect, provid- 


*4 Assuming an empty head is problematic since it may be used as an argument only in those cases in which 
it is modified by an adjunct, namely the relative clause (Müller 1999a: 97). See also Ott (2011: 187) for a 
later rediscovery of this problem. It can be solved in HPSG by assuming a unary projection that projects 
the appropriate category from a relative clause. I also use the unary projection to analyze so-called non- 
matching free relative clauses (Müller 1999a). In constructions with nonmatching free relative clauses, 
the relative clause fills an argument slot that does not correspond to the properties of the relative phrase 
(Bausewein 1990). Bausewein discusses the following example, in which the relative phrase is a PP but the 
free relative fills the accusative slot of kocht ‘cooks’. 


(i) Sie kocht, worauf sie Appetit hat. 
she cooks where.on she appetite has 


“She cooks what she feels like eating’ 


See Müller (1999a: 60-62) for corpus examples. 

Minimalist theories do not employ unary projections. Ott (2011) develops an analysis in which the 
category of the relative phrase is projected, but he does not have a solution for nonmatching free relative 
clauses (p. 187). The same is true for Citko’s analysis, in which an internally merged XP can provide the 
label. 

Many other proposals for labeling or, rather, non-labeling exist. For instance, some Minimalists want 
to eliminate labeling altogether and argue for a label-free syntax. As was pointed out by Osborne, Putnam 
& Groß (2011), such analyses bring Minimalism closer to Dependency Grammar. It is unclear how any of 
these models could deal with non-matching free relative clauses. Groß & Osborne (2009: Section 5.3.3) 
provide an analysis of free relatives in their version of Dependency Grammar, but deny the existence of 
nonmatching ones (p. 78). They suggest an analysis in which the relative phrase is the root/label of the free 
relative clause and hence they have the same problem as Minimalist proposals have with non-matching 
free relative clauses. As Groß & Osborne (2009: 73) and Osborne et al. (2011: 327) state: empty heads are 
usually not assumed in (their version of) Dependency Grammar. Neither are unary branching projections. 
This seems to make it impossible to state that free relative clauses with a relative phrase YP can function 
as XP, provided XP is a category that is higher in the obliqueness hierarchy of Keenan & Comrie (1977), a 
generalization that was discovered by Bausewein (1990) (see also Müller 1999a: 60-62 and Vogel 2001: 4). 
In order to be able to express the relevant facts, an element or a label has to exist that is different from the 
label of worauf in (i). 
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ing the same label, which can be taken as the label of the SO (p. 11). He sketches an analysis 
of interrogative clauses on p.13 in which the interrogative phrase has a Q feature and 
the remaining sentence from which the Q phrase was extracted has a Q feature as well. 
Since the two constituents share this property, the label of the complete clause will be 
Q. This kind of labeling will “perhaps” also be used for labeling normal sentences con- 
sisting of a subject and a verb phrase agreeing in person and number. These features 
would be responsible for the label of the sentence. The exact details are not worked out, 
but almost certainly will be more complex than (29b). 

A property that is inherent in both Chomsky (2005) and Chomsky (2013) is that the 
label is exclusively determined from one of the merged objects. As Bob Borsley pointed 
out to me, this is problematic for interrogative/relative phrases like (35). 


(35) with whom 


The phrase in (35) is both a prepositional phrase (because the first word is a prepo- 
sition) and an interrogative/relative phrase (because the second word is an interroga- 
tive/relative word). So, what is needed for the correct labeling of PPs like the one in (35) 
is a well-defined way of percolating different properties from daughters to the mother 
node.” 

For further problems concerning labeling and massive overgeneration by recent for- 
mulations of Merge see Fabregas et al. (2016). 

Summarizing, one can say that labeling, which was introduced to simplify the theory 
and reduce the amount of language specific innate knowledge that has to be assumed, 
can only be made to function with a considerable amount of stipulations. For instance, 
the combination of lexical elements requires the assumption of empty functional heads, 
whose only purpose is determining the syntactic category of a certain lexical element. 
If this corresponded to linguistic reality, knowledge about labeling, the respective func- 
tional categories, and information about those categories that have to be ignored for 
the labeling would have to be part of innate language specific knowledge and nothing 
would be gained. One would be left with bizarre analyses with an enormous degree 
of complexity without having made progress in the Minimalist direction. Furthermore, 
there are empirical problems and a large number of unsolved cases. 


25HPSG solves this problem by distinguishing head features including part of speech information and non- 
local features containing information about extraction and interrogative/relative elements. Head features 
are projected from the head, the nonlocal features of a mother node are the union of the nonlocal features 
of the daughters minus those that are bound off by certain heads or in certain configurations. 

Citko (2008: 926) suggests an analysis in which both daughters can contribute to the mother node. The 
result is a complex label like { P, { D, N }}. This is a highly complex data structure and Citko does not provide 
any information on how the relevant information that it contains is accessed. Is an object with the label 
{P,{D,N}}aP, aD or an N? One could say that P has priority since it is in the least embedded set, but D 
and N are in one set. What about conflicting features? How does a preposition that selects for a DP decide 
whether ( D, N} is a D or an N? In any case it is clear that a formalization will involve recursive relations 
that dig out elements of subsets in order to access their features. This adds to the overall complexity of 
the proposal and is clearly dispreferred over the HPSG solution, which uses one part of speech value per 
linguistic object. 
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The conclusion is that the label of a binary combination should not be determined 
in the ways suggested by Chomsky (2008; 2013). An alternative option for computing 
the label is to use the functor of a functor argument structure as the label (Berwick & 
Epstein 1995: 145). This is the approach taken by Categorial Grammar (Ajdukiewicz 1935; 
Steedman 2000) and in Stabler's Minimalist Grammars (2011b).”° Stabler’s formalization 
of Merge will be discussed in Section 4.6.4. 


4.6.3 Specifiers, complements, and the remains of X theory 


Chomsky (2008: 146) assumes that every head has exactly one complement but an arbi- 
trary number of specifiers. In standard X theory, the restriction that there can be at most 
one complement followed from the general X schema and the assumption that structures 
are at most binary branching: in standard X theory a lexical head was combined with 
all its complements to form an X’. If there are at most two daughters in a phrase, it 
follows that there can be only one complement (Sentences with ditransitive verbs have 
been analyzed with an empty head licensing an additional argument; see Larson (1988) 
for the suggestion of an empty verbal head and Miiller & Wechsler (2014a: Sections 6.1 
and 7) for a critical assessment of approaches involving little v). In standard X theory 
there was just one specifier. This restriction has now been abandoned. Chomsky writes 
that the distinction between specifier and complement can now be derived from the or- 
der in which elements are merged with their head: elements that are first-merged are 
complements and all others - those which are later-merged - are specifiers. 

Such an approach is problematic for sentences with monovalent verbs: according to 
Chomsky’s proposal, subjects of monovalent verbs would not be specifiers but comple- 
ments." This problem will be discussed in more detail in Section 4.6.4. 


26For the Categorial Grammar approach to work, it is necessary to assign the category x/x to an adjunct, 
where x stands for the category of the head to which the adjunct attaches. For instance, an adjective 
combines with a nominal object to form a nominal object. Therefore its category is n/n rather than adj. 

Similarly, Stabler’s approach does not extend to adjuncts unless he is willing to assign the category 

noun to attributive adjectives. One way out of this problem is to assume a special combination operation 
for adjuncts and their heads (see Frey & Gartner 2002: Section 3.2). Such a combination operation is 
equivalent to the Head-Adjunct Schema of HPSG. 

27Pauline Jacobson (p.c. 2013) pointed out that the problem with intransitive verbs could be solved by assum- 
ing that the last-merged element is the specifier and all non-last-merged elements are complements. This 
would solve the problems with intransitive verbs and with the coordination of verbs in (36) but it would 
not solve the problem of coordination in head-final languages as in (39). Furthermore, current Minimalist 
approaches make use of multiple specifiers and this would be incompatible with the Jacobsonian proposal 
unless one would be willing to state more complicated restrictions on the status of non-first-merged ele- 
ments. 
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Apart from this, theories assuming that syntactic objects merged with word groups are 
specifiers do not allow for analyses in which two lexical verbs are directly coordinated, 
as in (36): 


(36) He [knows and loves] this record. 


For example, in an analysis suggested by Steedman (1991: 264), and (being the head) 
is first merged with loves and then the result is merged with knows. The result of this 
combination is a complex object that has the same syntactic properties as the combined 
parts: the result is a complex verb that needs a subject and an object. After the combi- 
nation of the conjunction with the two verbs, the result has to be combined with this 
record and he. this record behaves in all relevant respects like a complement. Follow- 
ing Chomsky’s definition, however, it should be a specifier, since it is combined with 
the third application of Merge. The consequences are unclear. Chomsky assumes that 
Merge does not specify constituent order. According to him, the linearization happens 
at the level of Phonological Form (PF). The restrictions that hold there are not described 
in his recent papers. However, if the categorization as complement or specifier plays a 
role for linearization as in Kayne’s work (2011: 2, 12) and in Stabler’s proposal (see Sec- 
tion 4.6.4), this record would have to be serialized before knows and loves, contrary to the 
facts. This means that a Categorial Grammar-like analysis of coordination is not viable 
and the only remaining option would seem to assume that knows is combined with an 
object and then two VPs are coordinated. Kayne (1994: 61, 67) follows Wexler & Culi- 
cover (1980: 303) in suggesting such an analysis and assumes that the object in the first 
VP is deleted. However, Borsley (2005: 471) shows that such an analysis makes wrong 
predictions, since (37a) would be derived from (37b) although these sentences differ in 
meaning.”” 


(37) a. Hobbs whistled and hummed the same tune. 


b. Hobbs whistled the same tune and hummed the same tune. 


28 Chomsky (2013: 46) suggests the coordination analysis in (30): according to this analysis, the verbs would 
be merged directly and one of the verbs would be moved around the conjunction in a later step of the 
derivation. As was mentioned in the previous section, such analyses do not contribute to the goal of 
making minimal assumptions about innate language specific knowledge since it is absolutely unclear how 
such an analysis of coordination would be acquired by language learners. Hence, I will not consider this 
coordination analysis here. 

Another innovation of Chomsky’s 2013 paper is that he eliminates the concept of specifier. He writes 
in footnote 27 on page 43: There is a large and instructive literature on problems with Specifiers, but if the 
reasoning here is correct, they do not exist and the problems are unformulable. This is correct, but this also 
means that everything that was explained with reference to the notion of specifier in the Minimalist frame- 
work until now does not have an explanation any longer. If one follows Chomsky’s suggestion, a large 
part of the linguistic research of the past years becomes worthless and has to be redone. 

Chomsky did not commit himself to a particular view on linearization in his earlier work, but somehow 
one has to ensure that the entities that were called specifier are realized in a position in which constituents 
are realized that used to be called specifier. This means that the following remarks will be relevant even 
under current Chomskyan assumptions. 

2° See also Bartsch & Vennemann (1972: 102), Jackendoff (1977: 192-193), Dowty (1979: 143), den Besten (1983: 
104-105), Klein (1985: 8-9) and Eisenberg (1994b) for similar observations and criticism of similar proposals 
in earlier versions of Transformational Grammar. 
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Since semantic interpretation cannot see processes such as deletion that happen at the 
level of Phonological Form (Chomsky 1995b: Chapter 3), the differences in meaning can- 
not be explained by an analysis that deletes material. 

In a further variant of the VP coordination analysis, there is a trace that is related to 
this record. This would be a Right-Node-Raising analysis. Borsley (2005) has shown that 
such analyses are problematic. Among the problematic examples that he discusses is the 
following pair (see also Bresnan 1974: 615). 


(38) a. He tried to persuade and convince him. 


b. *He tried to persuade, but couldn't convince, him. 


The second example is ungrammatical if him is not stressed. In contrast, (38a) is well- 
formed even with unstressed him. So, if (38a) were an instance of Right-Node-Raising, 
the contrast would be unexpected. Borsley therefore excludes a Right-Node-Raising 
analysis. 

The third possibility to analyze sentences like (36) assumes discontinuous constituents 
and uses material twice: the two VPs knows this record and loves this record are coor- 
dinated with the first VP being discontinuous. (See Crysmann (2001) and Beavers & 
Sag (2004) for such proposals in the framework of HPSG.) However, discontinuous con- 
stituents are not usually assumed in the Minimalist framework (see for instance Kayne 
(1994: 67)). Furthermore, Abeillé (2006) showed that there is evidence for structures in 
which lexical elements are coordinated directly. This means that one needs analyses like 
the CG analysis discussed above, which would result in the problems with the speci- 
fier/complement status just discussed. 

Furthermore, Abeillé has pointed out that NP/DP coordinations in head-final lan- 
guages like Korean and Japanese present difficulties for Merge-based analyses. (39) 
shows a Japanese example. 


(39) Robin-to Kim 
Robin-and Kim 


*Kim and Robin' 


In the first step Robin is merged with to. In a second step Kim is merged. Since Kim is 
a specifier, one would expect that Kim is serialized before the head as it is the case for 
other specifiers in head-final languages. 

Chomsky tries to get rid ofthe unary branching structures of standard X theory, which 
were needed to project lexical items like pronouns and determiners into full phrases, 
referring to work by Muysken (1982). Muysken used the binary features MIN and MAx to 
classify syntactic objects as minimal (words or word-like complex objects) or maximal 
(syntactic objects that stand for complete phrases). Such a feature system can be used 
to describe pronouns and determiners as [+MIN, +MAx]. Verbs like give, however, are 
classified as [+Mın, —MAx]. They have to project in order to reach the [^ MAx]-level. If 
specifiers and complements are required to be [+MAx], then determiners and pronouns 
fulfill this requirement without having to project from X via X’ to the XP-level. 
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In Chomsky’s system, the MIN/MAX distinction is captured with respect to the com- 
pleteness of heads (complete = phrase) and to the property of being a lexical item. How- 
ever, there is a small but important difference between Muysken’s and Chomsky’s pro- 
posal: the predictions with regard to the coordination data that was discussed above. 
Within the category system of X theory, it is possible to combine two X^ to get a new, 
complex X?. This new object has basically the same syntactic properties that simple X°s 
have (see Jackendoff 1977: 51 and Gazdar, Klein, Pullum & Sag 1985). In Muysken's sys- 
tem, the coordination rule (or the lexical item for the conjunction) can be formulated 
such that the coordination of two +MIN items is a +MIN item. In Chomsky's system an 
analogous rule cannot be defined, since the coordination of two lexical items is not a 
lexical item any longer. 

Like Chomsky in his recent Minimalist work, Categorial Grammar (Ajdukiewicz 1935) 
and HPSG (Pollard and Sag 1987; 1994: 39—40) do not (strictly) adhere to X theory. Both 
theories assign the symbol NP to pronouns (for CG see Steedman & Baldridge (2006: 
p. 615), see Steedman (2000: Section 4.4) for the incorporation of lexical type raising in 
order to accommodate quantification). The phrase likes Mary and the word sleeps have 
the same category in Categorial Grammar (s\np). In both theories it is not necessary to 
projecta noun like tree from N? to N in order to be able to combine it with a determiner or 
an adjunct. Determiners and monovalent verbs in controlled infinitives are not projected 
from an X? level to the XP level in many HPSG analyses, since the valence properties 
of the respective linguistic objects (an empty sUBCAT or CoMBS list) are sufficient to de- 
termine their combinatoric potential and hence their distribution (Müller 1996d; Müller 
1999b). If the property of being minimal is needed for the description of a phenomenon, 
the binary feature LEX is used in HPSG (Pollard and Sag 1987: 172; 1994: 22). However, 
this feature is not needed for the distinction between specifiers and complements. This 
distinction is governed by principles that map elements of an argument structure list 
(ARG-ST) onto valence lists that are the value of the sPECIFIER and the COMPLEMENTS fea- 
ture (abbreviated as spr and comps respectively)? Roughly speaking, the specifier in a 
verbal projection is the least oblique argument of the verb for configurational languages 
like English. Since the argument structure list is ordered according to the obliqueness 
hierarchy of Keenan & Comrie (1977), the first element of this list is the least oblique 
argument of a verb and this argument is mapped to the spr list. The element in the spr 
list is realized to the left of the verb in SVO languages like English. The elements in the 
COMPS list are realized to the right of their head. Approaches like the one by Ginzburg & 
Sag (2000: 34, 364) that assume that head-complement phrases combine a word with its 
arguments have the same problem with coordinations like (36) since the head of the VP 
is not a word?! However, this restriction for the head can be replaced by one that refers 


Some authors assume a three-way distinction between subjects, specifiers, and complements. 

31s mentioned above, a multidomination approach with discontinuous constituents is a possible solution 
for the analysis of (36) (see Crysmann 2001 and Beavers & Sag 2004). However, the coordination of lexical 
items has to be possible in principle as Abeillé (2006) has argued. Note also that the HPSG approach 
to coordination cannot be taken over to the MP. The reason is that the HPSG proposals involve special 
grammar rules for coordination and MP comes with the claim that there is only Merge. Hence the additional 
introduction of combinatorial rules is not an option within the MP. 
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to the LEX feature rather than to the property of being a word or lexical item. 

Pollard & Sag as well as Sag & Ginzburg assume flat structures for English. Since one 
of the daughters is marked as lexical, it follows that the rule does not combine a head 
with a subset of its complements and then apply a second time to combine the result 
with further complements. Therefore, a structure like (40a) is excluded, since gave John 
is not a word and hence cannot be used as the head daughter in the rule. 


(40) a. [[gave John] a book] 
b. [gave John a book] 


Instead of (40a), only analyses like (40b) are admitted; that is, the head is combined 
with all its arguments all in one go. The alternative is to assume binary branching struc- 
tures (Müller 2015a; Müller & Orsnes 2015: Section 1.2.2). In such an approach, the head 
complement schema does not restrict the word/phrase status of the head daughter. The 
binary branching structures in HPSG correspond to External Merge in the MP. 

In the previous two sections, certain shortcomings of Chomsky's labeling definition 
and problems with the coordination of lexical items were discussed. In the following 
section, I discuss Stabler's definition of Merge in Minimalist Grammar, which is explicit 
about labeling and in one version does not have the problems discussed above. I will 
show that his formalization corresponds rather directly to HPSG representations. 


4.6.4 Minimalism, Categorial Grammar, and HPSG 


In this section, I will relate Minimalism, Categorial Grammar and HPSG to one another. 
Readers who are not yet familiar with Categorial Grammar and HPSG should skim this 
section or consult the Chapters 6, 8 and 9 and return here afterwards. 

In Section 4.6.2, it was shown that Chomsky's papers leave many crucial details about 
labeling unspecified. Stabler's work is relatively close to recent Minimalist approaches, 
but is worked out much more precisely (see also Stabler (2011a: 397, 399, 400) on formal- 
ization of post GB approaches). Stabler (2001) shows how Kayne's theory of remnant 
movement can be formalized and implemented. Stabler refers to his particular way of 
formalizing Minimalist theories as Minimalist Grammars (MG). There are a number of in- 
teresting results with regard to the weak capacity of Minimalist Grammars and variants 
thereof (Michaelis 2001). It has been shown, for instance, that the number of possible 
languages one could create with MGs includes the set of those which can be created by 
Tree Adjoining Grammars (see Chapter 12). This means that it is possible to assign a 
greater number of word strings to structures with MGs, however, the structures derived 
by MGs are not necessarily always the same as the structures created by TAGs. For more 
on the generative capacity of grammars, see Chapter 17. 

Although Stabler's work can be regarded as a formalization of Chomsky's Minimalist 
ideas, Stabler's approach differs from Chomsky's in certain matters of detail. Stabler 
assumes that the results of the two Merge operations are not sets but pairs. The head 
in a pair is marked by a pointer (‘<’ or ‘>’). Bracketed expressions like { a, { a, 6 }} 
(discussed in Section 4.6.2) are replaced by trees like the one in (41). 
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(41) 


1 2 


1 is the head in (41), 2 is the complement and 3 the specifier. The pointer points to the 
part of the structure that contains the head. The daughters in a tree are ordered, that is, 
3 is serialized before 1 and 1 before 2. 

Stabler (2011a: 402) defines External Merge as follows: 


/N df t, has exactly 1 node 
(42) em(t[-f], ty[f]) = 
or otherwise 


t t 


=f is a selection feature and f the corresponding category. When t,[=f] and t,[f] are 
combined, the result is a tree in which the selection feature of t; and the respective 
category feature of t, are deleted. The upper tree in (42) represents the combination of 
a (lexical) head with its complement. t; is positioned before t,. The condition that t; has 
to have exactly one node corresponds to Chomsky's assumption that the first Merge is 
a Merge with a complement and that all further applications of Merge are Merges with 
specifiers (Chomsky 2008: 146). 

Stabler defines Internal Merge as follows: 


(43) im(t,[+f]) = 


32 


> 


sf es 


ty ttt- e> e 


t; is a tree with a subtree t, which has the feature f with the value ‘~’. This subtree is 
deleted (t,[-f]” — el and a copy of the deleted subtree without the —f feature (t7 ) is 
positioned in specifier position. The element in specifier position has to be a maximal 
projection. This requirement is visualized by the raised ‘>’. 

Stabler provides an example derivation for the sentence in (44). 


(44) who Marie praises 


321 addition to what is shown in (43), Stabler’s definition contains a variant of the Shortest Move Constraint 
(SMC), which is irrelevant for the discussion at hand and hence will be omitted. 
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praises is a two-place verb with two =D features. This encodes the selection of two de- 
terminer phrases. who and Marie are two Ds and they fill the object and subject position 
of the verb. The resulting verbal projection Marie praises who is embedded under an 
empty complementizer which is specified as +wu and hence provides the position for 
the movement of who, which is placed in the specifier position of CP by the application 
of Internal Merge. The -wu feature of who is deleted and the result of the application of 
Internal Merge is who Marie praises. 

This analysis has a problem that was pointed out by Stabler himself in unpublished 
work cited by Veenstra (1998: 124): it makes incorrect predictions in the case of mono- 
valent verbs. If a verb is combined with an DP, the definition of External Merge in (42) 
treats this DP as a complement? and serializes it to the right of the head. Instead of 
analyses of sentences like (45a) one gets analyses of strings like (45b). 


(45 a. Max sleeps. 
b. *Sleeps Max. 


To solve this problem, Stabler assumes that monovalent verbs are combined with a 
nonovert object (see Veenstra (1998: 61, 124) who, quoting Stabler's unpublished work, 
also adopts this solution). With such an empty object, the resulting structure contains 
the empty object as a complement. The empty object is serialized to the right of the verb 
and Max is the specifier and hence serialized to the left of the verb as in (46). 


(46) Maxsleeps . 


Of course, any analysis of this kind is both stipulative and entirely ad hoc, being moti- 
vated only by the wish to have uniform structures. Moreover, it exemplifies precisely 
one of the methodological deficiencies of Transformational Generative Grammar dis- 
cussed at length by Culicover & Jackendoff (2005: Section 2.1.2): the excessive appeal to 
uniformity. 

An alternative is to assume an empty verbal head that takes sleeps as complement and 
Max as subject. Such an analysis is often assumed for ditransitive verbs in Minimalist 
theories which assume Larsonian verb shells (Larson 1988). Larsonian analyses usually 
assume that there is an empty verbal head that is called little v and that contributes a 
causative meaning. As was discussed in Section 4.1.4, Adger (2003) adopts a little v-based 
analysis for intransitive verbs. Omitting the TP projection, his analysis is provided in 
Figure 4.22 on the next page. Adger argues that the analysis of sentences with unergative 
verbs involves a little v that selects an agent, while the analysis of unaccusative verbs 
involves a little v that does not select an N head. For unaccusatives, he assumes that 
the verb selects a theme. He states that little v does not necessarily have a causative 
meaning but introduces the agent. But note that in the example at hand the subject of 
sleep is neither causing an event, nor is it necessarily deliberately doing something. So 
it is rather an undergoer than an agent. This means that the assumption of the empty v 
head is made for purely theory-internal reasons without any semantic motivation in the 


S Compare also Chomsky's definition of specifier and complement in Section 4.6.3. 
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Max o 


v sleep 


Figure 4.22: Little v-based analysis of Max sleeps 


case of intransitives. If the causative contribution of little v in ditransitive constructions 
is assumed, this would mean that one needs two little vs, one with and one without a 
causative meaning. In addition to the lack of theory-external motivation for little v, there 
are also empirical problems for such analyses (for instance with coordination data). The 
reader is referred to Müller & Wechsler (2014a: Sections 6.1 and 7) for further details. 

Apart from the two operations that were defined in (42) and (43), there are no other 
operations in MG.** Apart from the problems with monovalent verbs, this results in the 
problem that was discussed in Section 4.6.3: there is no analysis with a direct combina- 
tion of verbs for (36) — repeated here as (47). 


(47) He [knows and loves] this record. 


The reason is that the combination of knows, and and loves consists of three nodes and 
the Merge of knows and loves with this record would make this record the specifier of 
the structure. Therefore this record would be serialized before knows and loves, contrary 
to the facts. Since the set of languages that can be generated with MGs contains the 
languages that can be generated with certain TAGs and with Combinatorial Categorial 
Grammar (Michaelis 2001), the existence of a Categorial Grammar analysis implies that 
the coordination examples can be derived in MGs somehow. But for linguists, the fact 
that it is possible to generate a certain string at all (the weak capacity of a grammar) is 
of less significance. It is the actual structures that are licensed by the grammar that are 
important (the strong capacity). 


4.6.4.1 Directional Minimalist Grammars and Categorial Grammar 


Apart from reintroducing X^ categories, the coordination problem can be solved by 
changing the definition of Merge in a way that allows heads to specify the direction 
of combination with their arguments: Stabler (2011b: p. 635) suggests marking the posi- 
tion of an argument relative to its head together with the selection feature and gives the 
following redefinition of External Merge. 


Y For extensions see Frey & Gartner (2002: Section 3.2). 
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EN ifa is x= 


to t 


The position of the equal sign specifies on which side of the head an argument has to 
be realized. This corresponds to forward and backward Application in Categorial Gram- 
mar (see Section 8.1.1). Stabler calls this form of grammar Directional MG (DMG). This 
variant of MG avoids the problem with monovalent verbs and the coordination data is 
unproblematic as well if one assumes that the conjunction is a head with a variable cat- 
egory that selects for elements of the same category to the left and to the right of itself. 
know and love would both select an object to the right and a subject to the left and this 
requirement would be transferred to knows and loves.” See Steedman (1991: 264) for 
the details of the CG analysis and Bouma & van Noord (1998: 52) for an earlier HPSG 
proposal involving directionality features along the lines suggested by Stabler for his 
DMGs. 


4.6.4.2 Minimalist Grammars and Head-Driven Phrase Structure Grammar 


The notation for marking the head of a structure with ‘>’ and ‘<’ corresponds directly 
to the HPSG representation of heads. Since HPSG is a sign-based theory, information 
about all relevant linguistic levels is represented in descriptions (phonology, morphol- 
ogy, syntax, semantics, information structure). (49) gives an example: the lexical item 
for the word grammar. 


word 
PHON ( gramor ) 
loc 
cat 
(49) CAT  |HEAD noun 
SYNSEM|LOC spp (DET) 
grammar 
CONT ... 
INST X 


The part of speech of grammar is noun. In order to form a complete phrase, it requires 
a determiner. This is represented by giving the spr feature the value ( DET ). Semantic 
information is listed under conT. For details see Chapter 9. 


35Note however, that this transfer makes it necessary to select complex categories, a fact that I overlooked in 
Müller (2013c). The selection of simplex features vs. complex categories will be discussed in Section 4.6.5. 
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Since we are dealing with syntactic aspects exclusively, only a subset of the used fea- 
tures is relevant: valence information and information about part of speech and certain 
morphosyntactic properties that are relevant for the external distribution of a phrase is 
represented in a feature description under the path svNsEM|roc|car. The features that 
are particularly interesting here are the so-called head features. Head features are shared 
between a lexical head and its maximal projection. The head features are located inside 
CAT and are grouped together under the path HEAD. Complex hierarchical structure is 
also modelled with feature value pairs. The constituents of a complex linguistic object 
are usually represented as parts of the representation of the complete object. For in- 
stance, there is a feature HEAD-DAUGHTER the value of which is a feature structure that 
models a linguistic object that contains the head of a phrase. The Head Feature Principle 
(50) refers to this daughter and ensures that the head features of the head daughter are 
identical with the head features of the mother node, that is, they are identical to the head 
features of the complete object. 


SYNSEM|LOC|CAT|HEAD 
HEAD-DTR|SYNSEM|LOC|CAT|HEAD 


e" 


(50) headed-phrase > 


fer 


Identity is represented by boxes with the same number. 

Ginzburg & Sag (2000: 30) represent all daughters of a linguistic object in a list that is 
given as the value ofthe DAUGHTERS attribute. The value ofthe feature HEAD-DAUGHTER 
is identified with one of the elements of the DAUGHTERS list: 


HEAD-DTR |1 
DTRS ( 


(51) 


fer 


a, a) 


HEAD-DTR [1 
DTRS (a, p) 

a and f are shorthands for descriptions of linguistic objects. The important point about 
the two descriptions in (51) is that the head daughter is identical to one of the two daugh- 
ters, which is indicated by the [1] in front of a and f, respectively. In the first feature 
description, the first daughter is the head and in the second description, the second 
daughter is the head. Because of the Head Feature Principle, the syntactic properties of 
the whole phrase are determined by the head daughter. That is, the syntactic properties 
of the head daughter correspond to the label in Chomsky's definition. This notation cor- 
responds exactly to the one that is used by Stabler: (51a) is equivalent to (52a) and (51b) 
is equivalent to (52b). 


un 


(52) a. b. 


< > 


IN AN 
a fp a p 
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An alternative structuring of this basic information, discussed by Pollard & Sag (1994: 
Chapter 9), uses the two features HEAD-DAUGHTER and NON-HEAD-DAUGHTERS rather 
than HEAD-DAUGHTER and DAUGHTERS. This gives rise to feature descriptions like (53a), 
which corresponds directly to Chomsky’s set-based representations, discussed in Sec- 
tion 4.6.2 and repeated here as (53b). 


HEAD-DTR a 
NON-HEAD-DTRS ( p) 


(53) «| 
b. {a (o, pj 


The representation in (53a) does not contain information about linear precedence of o 
and f. Linear precedence of constituents is constrained by linear precedence rules, which 
are represented independently from constraints regarding (immediate) dominance. 

The definition of Internal Merge in (43) corresponds to the Head-Filler Schema in 
HPSG (Pollard & Sag 1994: 164). Stabler's derivational rule deletes the subtree t,[-f]”. 
HPSG is monotonic, that is, nothing is deleted in structures that are licensed by a gram- 
mar. Instead of deleting t, inside of a larger structure, structures containing an empty 
element (NB - not a tree) are licensed directly.” Both in Stabler’s definition and in the 
HPSG schema, t, is realized as filler in the structure. In Stabler’s definition of Internal 
Merge, the category of the head daughter is not mentioned, but Pollard & Sag (1994: 164) 
restrict the head daughter to be a finite verbal projection. Chomsky (2007: 17) assumes 
that all operations but External Merge operate on phase level. Chomsky assumes that 
CP and v*P are phases. If this constraint is incorporated into the definition in (43), the re- 
strictions on the label oft, would have to be extended accordingly. In HPSG, sentences 
like (54) have been treated as VPs, not as CPs and hence Pollard & Sag’s requirement 
that the head daughter in the Head Filler Schema be verbal corresponds to Chomsky’s 
restriction. 


(54) Bagels, I like. 


Hence, despite minor presentational differences, we may conclude that the formalization 
of Internal Merge and that of the Head-Filler Schema are very similar. 

An important difference between HPSG and Stabler’s definition is that ‘movement’ is 
not feature driven in HPSG. This is an important advantage since feature-driven move- 
ment cannot deal with instances of so-called altruistic movement (Fanselow 2003a), that 
is, movement of a constituent that happens in order to make room for another con- 
stituent in a certain position (see Section 4.6.1.4). 

A further difference between general X theory and Stabler’s formalization of Internal 
Merge on the one hand and HPSG on the other is that in the latter case there is no 
restriction regarding the completeness (or valence ‘saturation’) of the filler daughter. 
Whether the filler daughter has to be a maximal projection (English) or not (German), 


36See Bouma, Malouf & Sag (2001) for a traceless analysis of extraction in HPSG and Miiller (2017a: Chapter 7) 
and Chapter 19 of this book for a general discussion of empty elements. 
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follows from restrictions that are enforced locally when the trace is combined with its 
head. This makes it possible to analyze sentences like (55) without remnant movement." 


(55) Gelesen, hat, das Buch keiner 
read has the book nobody 


apap 


In contrast, Stabler is forced to assume an analysis like the one in (56b) (see also G. Miiller 
(1998) for a remnant movement analysis). In a first step, das Buch is moved out of the VP 
(56a) and in a second step, the emptied VP is fronted as in (56b). 


(56) a. Hat [das Buch]; [keiner [yp _; gelesen]]. 
b. [yp _; Gelesen]; hat [das Buch]; [keiner _;]. 


Haider (1993: 281), De Kuthy & Meurers (2001: Section 2) and Fanselow (2002) showed 
that this kind of remnant movement analysis is problematic for German. The only phe- 
nomenon that Fanselow identified as requiring a remnant movement analysis is the prob- 
lem of multiple fronting (see Müller (2003a) for an extensive discussion of relevant data). 
Müller (2005b,c; 2017a) develops an alternative analysis of these multiple frontings which 
uses an empty verbal head in the Vorfeld, but does not assume that adjuncts or arguments 
like das Buch in (56b) are extracted from the Vorfeld constituent. Instead of the remnant 
movement analysis, the mechanism of argument composition from Categorial Grammar 
(Geach 1970; Hinrichs & Nakazawa 1994) is used to ensure the proper realization of argu- 
ments in the sentence. Chomsky (2007: 20) already uses argument composition as part 
of his analysis of TPs and CPs. Hence both remnant movement and argument composi- 
tion are assumed in recent Minimalist proposals. The HPSG alternative, however, would 
appear to need less theoretical apparatus and hence has to be preferred for reasons of 
parsimony. 

Finally, it should be mentioned that all transformational accounts have problems with 
Across the Board extraction like (57a) and (57) in which one element corresponds to 
several gaps. 


(57) a. Bagels, I like and Ellison hates.” 
b. The man who; [Mary loves | ;] and [Sally hates _;] computed my tax. 


This problem was solved for GPSG by Gazdar (1981b) and the solution carries over to 
HPSG. The Minimalist community tried to address these problems by introducing oper- 
ations like sideward movement (Nunes 2004) where constituents can be inserted into 
sister trees. So in the example in (57a), Bagels is copied from the object position of hates 
into the object position of like and then these two copies are related to the fronted el- 
ement. Kobele criticized such solutions since they overgenerate massively and need 
complicated filters. What he suggests instead is the introduction of a GPSG-style sLasH 
mechanism into Minimalist theories (Kobele 2008). 


37See also Müller & Orsnes (2013b) for an analysis of object shift in Danish that can account for verb fronting 
without remnant movement. The analysis does not have any of the problems that remnant movement 
analyses have. 

35bollard & Sag (1994: 205). 
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Furthermore, movement paradoxes (Bresnan 2001: Chapter 2) can be avoided by not 
sharing all information between filler and gap, a solution that is not available for trans- 
formational accounts, which usually assume identity of filler and gap or - as under the 
Copy Theory of Movement - assume that a derivation contains multiple copies of one 
object only one of which is spelled out. See also Borsley (2012) for further puzzles for, 
and problems of, movement-based approaches. 

A further difference between MG and HPSG is that the Head-Filler Schema is not the 
only schema for analyzing long-distance dependencies. As was noted in footnote 12 on 
page 147, there is dislocation to the right (extraposition) as well as fronting. Although 
these should certainly be analyzed as long-distance dependencies, they differ from other 
long-distance dependencies in various respects (see Section 13.1.5). For analyses of extra- 
position in the HPSG framework, see Keller (1995), Bouma (1996), and Miiller (1999b). 

Apart from the schema for long-distance dependencies, there are of course other sche- 
mata in HPSG which are not present in MG or Minimalism. These are schemata which 
describe constructions without heads or are necessary to capture the distributional prop- 
erties of parts of constructions, which cannot be easily captured in lexical analyses (e.g., 
the distribution of wh- and relative pronouns). See Section 21.10. 

Chomsky (2010) has compared a Merge-based analysis of auxiliary inversion to a 
HPSG analysis and critiqued that the HPSG analysis uses ten schemata rather than one 
(Merge). Ginzburg & Sag (2000) distinguish three types of construction with moved aux- 
iliaries: inverted sentences such as those with fronted adverbial and with wh-questions 
(58a,b), inverted exclamatives (58c) and polar interrogatives (58d): 


(58) Under no circumstances did she think they would do that. 


a. 
b. Whose book are you reading? 
c. AmI tired! 

d. Did Kim leave? 


Fillmore (1999) captures various different usage contexts in his Construction Grammar 
analysis of auxiliary inversion and shows that there are semantic and pragmatic differ- 
ences between the various contexts. Every theory must be able to account for these. 
Furthermore, one does not necessarily require ten schemata. It is possible to determine 
this - as Categorial Grammar does - in the lexical entry for the auxiliary or on an empty 
head (see Chapter 21 for a more general discussion of lexical and phrasal analyses). Re- 
gardless of this, every theory has to somehow account for these ten differences. If one 
wishes to argue that this has nothing to do with syntax, then somehow this has to be 
modelled in the semantic component. This means that there is no reason to prefer one 
theory over another at this point. 


4.6.5 Selection of atomic features vs. selection of complex categories 


Berwick & Epstein (1995) pointed out that Minimalist theories are very similar to Cate- 
gorial Grammar and I have discussed the similarities between Minimalist theories and 
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HPSG in Müller (2013c) and in the previous subsections. However, I overlooked one cru- 
cial difference between the usual assumptions about selection in Minimalist proposals 
on the one hand and Categorial Grammar, Dependency Grammar, LFG, HPSG, TAG, 
and Construction Grammar on the other hand: what is selected in the former type of 
theory is a single feature, while the latter theories select for feature bundles. This seems 
to be a small difference, but the consequences are rather severe. Stabler’s definition of 
External Merge that was given on page 165 removes the selection feature (=f) and the 
corresponding feature of the selected element (f). In some publications and in the in- 
troduction in this book, the selection features are called uninterpretable features and 
are marked with a u. The uninterpretable features have to be checked and then they 
are removed from the linguistic object as in Stabler’s definition. The fact that they have 
been checked is represented by striking them out. It is said that all uninterpretable fea- 
tures have to be checked before a syntactic object is send to the interfaces (semantics 
and pronunciation). If uninterpretable features are not checked, the derivation crashes. 
Adger (2003: Section 3.6) explicitly discusses the consequences of these assumptions: a 
selecting head checks a feature of the selected object. It is not possible to check features 
of elements that are contained in the object that a head combines with. Only features 
at the topmost node, the so-called root node, can be checked with external merge. The 
only way features inside complex objects can be checked is by means of movement. This 
means that a head may not combine with a partially saturated linguistic object, that is, 
with a linguistic object that has an unchecked selection feature. I will discuss this design 
decision with reference to an example provided by Adger (2003: 95). The noun letters 
selects for a P and Ps select for an N. The analysis of (59a) is depicted left in Figure 4.23. 


(59) a. letters to Peter 
b. * letters to 


N N 
letters[N, pl, #2] P letters[N, pl, #2] to[P, uN] 


[N 


to[P, 4N] Peter 
Figure 4.23: The analysis of letters to Peter according to Adger (2003: 95) 


The string in (59b) is ruled out since the uninterpretable N feature of the preposition to 
is not checked. So this integrates the constraint that all dependent elements have to be 
maximal into the core mechanism. This makes it impossible to analyze examples like 
(60) in the most straightforward way, namely as involving a complex preposition and a 
noun that is lacking a determiner: 


(60) vom Bus 
from.the bus 
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In theories in which complex descriptions can be used to describe dependants, the de- 
pendent may be partly saturated. So for instance in HPSG, fused prepositions like vom 
‘from.the’ can select an N, which is a nominal projection lacking a specifier: 


(61 N[spr (DET )] 


The description in (61) is an abbreviation for an internally structured set of feature-value 
pairs (see Section 9.1.1). The example here is given for the illustration of the differences 
only, since there may be ways of accounting for such cases in a single-feature-Merge 
system. For instance, one could assume a DP analysis and have the complex preposition 
select a complete NP (something of category N with no uninterpretable features). Al- 
ternatively, one can assume that there is indeed a full PP with all the structure that is 
usually assumed and the fusion of preposition and determiner happens during pronun- 
ciation. The first suggestion eliminates the option of assuming an NP analysis as it was 
suggested by Bruening (2009) in the Minimalist framework. 

Apart from this illustrative example with a fused preposition, there are other cases 
in which one may want to combine unsaturated linguistic objects. I already discussed 
coordination examples above. Another example is the verbal complex in languages like 
German, Dutch, and Japanese. Of course there are analyses of these languages that do 
not assume a verbal complex (G. Miiller 1998; Wurmbrand 2003a), but these are not 
without problems. Some of the problems were discussed in the previous section as well. 

Summing up this brief subsection, it has to be said that the feature checking mech- 
anism that is built into the conception of Merge is more restrictive than the selection 
that is used in Categorial Grammar, Lexical Functional Grammar, HPSG, Construction 
Grammar, and TAG. In my opinion, it is too restrictive. 


4.6.6 Summary 


In sum, one can say that the computational mechanisms of the Minimalist Program (e.g., 
transderivational constraints and labeling), as well as the theory of feature-driven move- 
ment are problematic and the assumption of empty functional categories is sometimes 
ad hoc. If one does not wish to assume that these categories are shared by all languages, 
then proposing two mechanisms (Merge and Move) does not represent a simplification 
of grammar since every single functional category which must be stipulated constitutes 
a complication of the entire system. 

The labeling mechanism is not yet worked out in detail, does not account for the 
phenomena it was claimed to provide accounts for, and hence should be replaced by the 
head/functor-based labeling that is used in Categorial Grammar and HPSG. 


4.7 Summary and classification 


This section is similar to Section 3.6. I first comment on language acquisition and then 
on formalization. 
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4.7.1 Explaining language acquisition 


Chomsky (2008: 135) counts theories in the MP as Principle & Parameter analyses and 
identifies MP parameters as being in the lexicon. Also, see Hornstein (2013: 396). UG is 
defined as possibly containing non-language-specific components, which are genetically 
determined (Chomsky 2007: 7). UG consists of unbounded Merge and the condition that 
expressions derived by a grammar must fulfill the restrictions imposed by the phonolog- 
ical and conceptual-intentional interfaces. In addition, a specific repertoire of features is 
assumed to be part of UG (Chomsky 2007: 6-7). The exact nature of these features has not 
been explained in detail and, as a result, the power of UG is somewhat vague. However, 
there is a fortunate convergence between various linguistic camps as Chomsky does not 
assume that the swathes of functional projections which we encountered in Section 4.6.1 
also form part of UG (however, authors like Cinque & Rizzi (2010) do assume that a hier- 
archy of functional projections is part of UG). Since there are still parameters, the same 
arguments used against GB approaches to language acquisition that were mentioned in 
Section 3.6.1 are still relevant for theories of language acquisition in the Minimalist Pro- 
gram. See Chapter 16 for an in-depth discussion of approaches to language acquisition 
and the Principles & Parameters model as well as input-based approaches. 

Chomsky’s main goal in the Minimalist Program is to simplify the theoretical assump- 
tions regarding formal properties of language and the computational mechanisms that 
are used so much as to make it plausible that they or relevant parts of them are part of 
our genetic endowment. But if we recapitulate what was assumed in this chapter, it is 
difficult to believe that Minimalist theories achieve this goal. To derive a simple sentence 
with an intransitive verb, one needs several empty heads and movements. Features can 
be strong or weak, Agree operates nonlocally in trees across several phrase boundaries. 
And in order to make correct predictions, it has to be made sure that Agree can only 
see the closest possible element (13)-(14). This is a huge machinery in comparison to a 
Categorial Grammar that just combines adjacent things. Categorial Grammars can be 
acquired from input (see Section 13.8.3), while it is really hard to imagine how the fact 
that there are features that trigger movement when they are strong, but do not trigger 
it when they are weak, should be acquired from data alone. 


4.7.2 Formalization 


Section 3.6.2 commented on the lack of formalization in transformational grammar up 
until the 1990s. The general attitude towards formalization did not change in the minimal- 
ist era and hence there are very few formalizations and implementations of Minimalist 
theories. 

Stabler (2001) shows how it is possible to formalize and implement Kayne’s theory 
of remnant movement. In Stabler’s implementation”, there are no transderivational 


His system is available at: http://linguistics.ucla.edu/people/stabler/coding.html. 2018-02-20. 
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constraints, no numerations*°, he does not assume Agree (see Fong 2014: 132) etc. The 
following is also true of Stabler’s implementation of Minimalist Grammars and GB sys- 
tems: there are no large grammars. Stabler’s grammars are small, meant as a proof of 
concept and purely syntactic. There is no morphology”, no treatment of multiple agree- 
ment (Stabler 2011b: Section 27.4.3) and above all no semantics. PF and LF processes are 
not modelled." The grammars and the computational system developed by Sandiway 
Fong are of similar size and faithfulness to the theory (Fong & Ginsburg 2012; Fong 2014): 
the grammar fragments are small, encode syntactic aspects such as labeling directly in 
the phrase structure (Fong & Ginsburg 2012: Section 4) and therefore, fall behind X the- 
ory. Furthermore, they do not contain any morphology. Spell-Out is not implemented, 
so in the end it is not possible to parse or generate any utterances." Herring’s (2016) 
dissertation is a promising beginning. Herring developed a system that can be used for 


#0 There is a numeration lexicon in Veenstra (1998: Chapter 9). This lexicon consists of a set of numerations, 
which contain functional heads, which can be used in sentences of a certain kind. For example, Veenstra 
assumes numerations for sentences with bivalent verbs and subjects in initial position, for embeded sen- 
tences with monovalent verbs, for wh-questions with monovalent verbs, and for polar interrogatives with 
monovalent verbs. An element from this set of numerations corresponds to a particular configuration and 
a phrasal construction in the spirit of Construction Grammar. Veenstra's analysis is not a formalization 
of the concept of the numeration that one finds in Minimalist works. Normally, it is assumed that a nu- 
meration contains all the lexical entries which are needed for the derivation of a sentence. As (i) shows, 
complex sentences can consist of combinations of sentences with various different sentence types: 


(i) Der Mann, der behauptet hat, dass Maria gelacht hat, steht neben derPalme, die ` (mm letzten 
the man whoclaimed has that Maria laughed has stands next.to the palm.tree which in last 
Jahr gepflanzt wurde. 
year planted was 


"Ihe man who claimed Maria laughed is standing next to the palm tree that was planted last year. 


In (i), there are two relative clauses with verbs of differing valence, an embedded sentence with a monova- 
lent verb and the matrix clause. Under a traditional understanding of numerations, Veenstra would have 
to assume an infinite numeration lexicon containing all possible combinations of sentence types. 

The test sentences have the form as in (i). 


(i) 


the king will -s eat 


a. 
b. the king have -s eat -en 


p 


the king be -s eat -ing 
d. the king -s will -s have been eat -ing the pie 


42 See Sauerland & Elbourne (2002) for suggestions of PF and LF-movement and the deletion of parts of copies 
(p. 285). The implementation of this would be far from trivial. 

The claim by Berwick, Pietroski, Yankama & Chomsky (2011: 1221) in reference to Fong’s work is just plain 
wrong: But since we have sometimes adverted to computational considerations, as with the ability to “check” 
features of a head/label, this raises a legitimate concern about whether our framework is computationally 
realizable. So it is worth noting that the copy conception of movement, along with the locally oriented “search 
and labeling” procedure described above, can be implemented computationally as an efficient parser; see Fong, 
2011, for details. If one has a piece of software which cannot parse a single sentence, then one cannot 
claim that it is efficient since one does not know whether the missing parts of the program could make it 
extremely inefficient. Furthermore, one cannot compare the software to other programs. As has already 
been discussed, labeling is not carried out by Fong as was described in Chomsky’s work, but instead he 
uses a phrase structure grammar of the kind described in Chapter 2. 
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4.7 Summary and classification 


grammar development in 